A Series C startup focused on AI-powered customer solutions is looking for a full-time Machine Learning Engineer to join their growing team. They are building an agentic voice platform that would enable patients to have real-time conversations with AI for healthcare in which the Machine Learning Engineer will be creating optimized and domain-specific conversational AI by fine-tuning LLM’s using LoRA, QLoRA and DPO.
This role requires extensive experience with fine tuning LLM’s and pushing LLMs to be deployed at a large scale. This opportunity is perfect for someone that is passionate in technology with impact, wanting to push to limits of AI, and joining a curious and fast-paced team.
Required Skills & Experience
• 4+ years of developing Machine Learning systems from research to production
• 2+ years in conversational AI
• Experience with fine-tuning and compression
• Experience with Python, Pytorc, LoRA, QLoRA, Langchain, LangGraph, Kafka, Redis, Postgres, Kubernetes, MLflow/SageMaker)
Desired Skills & Experience
• Leadership experience such as defining ML best practices and mentoring junior engineers.
• Experience with ML research such as running POCs with models to stay ahead in the latest LLM research
• Experience with scale inference
The Offer
• Hybrid 3 days/week in the West Loop
• Competitive Salary
• Medical, Dental, Vision Benefits
• PTO
• Stock options + bonus
Applicants must be currently authorized to work in the United States on a full-time basis now and in the future.
#LI-1