Job description
Numerator is looking for a passionate Engineer to join our growing Research and Development team.
This is a unique opportunity where you will get a chance to work with an established and rapidly evolving platform that handles millions of requests and massive amounts of events, and other data.
In this position, you will be responsible for taking on new initiatives to design, build, deploy, and support high performance deep learning systems in a rapidly-scaling environment.
As a member of our team, you will make an immediate impact as you help build out and expand our technology platforms across several software products.
This is a high growth and impact role that will give you tons of opportunity to drive decisions for projects from inception through production.
What You'll Do:
+ Help to create and build out our API framework that enables our machine learning team to expose their models to engineering teams across the organization.
+ Orchestrate systems using Kubernetes.
+ Set up CI/CD pipelines of multiple applications.
+ Scale multiple APIs to handle millions of requests per day using both synchronous and asynchronous webhook callback approaches.
+ Work with deep learning engineers and DevOps teams to troubleshoot server/infrastructure/systems-level issues.
+ Assist in data gathering and enhancing and building out visualizations to help QC model output.
+ Maintaining the system in general, on-call bug-fixing for mission critical issues.
+ Work on and build out tools that simplify package upgrades across many microserves
+ 3+ years experience deploying robust APIs in production environments (ideally cloud-based environments such as AWS or GCP).
+ Knowledge of deploying and troubleshooting containers (Docker) and container orchestration systems (Kubernetes).
+ 2+ years Experience with python and web frameworks like django, flask or FastAPI.
+ An understanding of the importance of CI/CD pipelines.
+ You look ahead to identify opportunities and foster a culture of innovation.
+ BS in Computer Science or a related field, or equivalent work experience.
Nice to Haves:
+ Knowledge of Terraform or other infrastructure as code solutions.
+ Experience working with Amazon web services or another cloud provider.
+ Experience deploying and working on Kubernetes clusters on EKS.
+ Knowledge of monitoring, tracing, and metrics visualization (e.g. Prometheus, Sentry, Fluentd, Grafana, OpenTracing,/Jaeger, etc)
+ Production experience with deep learning systems (e.g. inference optimization, GPUs)
What We Offer:
+ An inclusive and collaborative company culture
+ An opportunity to have an impact on our growing teams and organization
+ Ownership of platforms and environments with industry leading products
+ Market competitive total compensation package
+ Volunteer time off and charitable donation matching
+ Strong support for career growth, including mentorship programs, leadership training, and access to employee resource groups
+ Regular hackathons to build your own projects and work with people across the entire company
+ Excellent benefits package including medical/vision/dental insurance, unlimited PTO, and 401k plus matching.
Required Skill Profession
Other General