Data Infrastructure Engineer
Deep Instinct
Description
Deep Instinct, the first cyber security company to apply Deep Learning to cyber security is an innovative start-up that has a unique and game-changing software solution to protect against Zero Day & APT cyber-attacks. This is an incredible opportunity to get in early at a Pre-IPO Cyber Security company that is poised to do huge things! We're on a mission to disrupt the cyber security market and the time is NOW!
In the past 7 years we’ve faced, and continue facing, the challenges of bringing Deep-Learning technologies and performance-driven software to a single solution.
Why work with us:
At Deep Instinct we are committed to creating excellent employee experience. We offer competitive salaries, a generous benefits package among great culture. We have some of the most forward-thinking and hardworking people in the world working for us. What are we looking for?
We’re looking for an additional player to join our agentless team that would bring the world the cyber security solution it needs. Someone who is a savvy developer, aspiring to spearhead the design and development of a cutting-edge malware detection service, all while having fun throughout the day. Someone who isn’t afraid to learn on their own while expanding the vast knowledge currently existing in the company, all in the pursue to help protect the world’s data.
Responsibilities:
As a data infrastructure engineer in Deep Instinct, you'll be at the forefront of data and ML infrastructure development. On a daily basis, you will design, develop, and maintain the systems that allow for the seamless flow, quality, and reliability of our data and deep learning models. You will be part of a cross-functional team in close cooperation with Algo and Cyber Security teams and answer their needs using cutting-edge technologies.
Responsibilities
Develop creative and robust software solutions to automate the operation of large distributed data processing systems.
Help build and support internal and external frameworks like spark, pytorch, etc to interact with various cloud technologies.
Designing and implementing complex ETL pipelines.
Collaborating with data scientists to develop new ML Ops processes.
Keep close and overall monitoring for all the deployments of the systems, maintain the system’s stability, improve the performance, the performance bottlenecks, track and troubleshoot, cost optimization.
Requirements
Minimum 3 years of programming experience with Python in a collaborative CI/CD software development environment, including git, code review, easily maintained, scalable, and documented code.
Data processing experience with ETLs and SQL data bases (experience with pipelines).
Experience with cloud environment management.
Experience with distributed data systems such as Hadoop, Elasticsearch, etc and their performance optimization.
Working with systems like Docker and Kubernetes.
Bachelor's degree in Computer Science, Software Engineering or equivalent practical experience.