Job description:
Key Responsibilities:
• Design, build, and fine-tune Generative AI models (LLMs, diffusion models, transformers) for enterprise use cases.
• Implement end-to-end ML pipelines — data ingestion, preprocessing, model training, and deployment.
• Collaborate with data engineering and cloud teams to operationalize models on AWS or other cloud platforms.
• Experiment with prompt engineering, embeddings, and model customization for domain-specific applications.
• Develop APIs and microservices to serve AI/ML models in production environments.
• Optimize model performance, cost, and scalability through fine-tuning, quantization, or caching strategies.
• Stay current with emerging AI frameworks, GenAI tools, and open-source LLMs (e.g., OpenAI, Hugging Face, LangChain).