A company is looking for a Software Engineer L4 / L5, Model Serving Systems, Machine Learning Platform.
Key Responsibilities
Develop and expand compute infrastructure to support AI needs and drive ML / AI innovation
Build scalable and robust model serving systems for ML applications, including LLMs
Collaborate cross-functionally with engineers, product managers, and data scientists
Required Qualifications
Experience in building high-traffic distributed services and infrastructure for online ML model inference
Understanding of scalable model-serving solutions for generative models and LLMs
Proficiency in object-oriented programming, preferably in Java
Familiarity with deploying ML models using tools like Triton Inference Server and Docker
BS / MS in Computer Science, Applied Math, Engineering, or a related field