Job Description:
• Design, deploy, and operate Model Context Protocol (MCP) servers that handle checkpoint routing, manage context windows, and enforce safety gates
• Build offline and live eval pipelines for alignment, factuality, grounding, and hallucinations
The role requires experience in distributed training and inference with DeepSpeed/FSDP, LoRA/QLoRA, mixed precision, and performance tuning on vLLM or Triton clusters.
Familiarity with model alignment, JSON-schema function calls, and external tool integration is also required.