Data Scientist LLM (Large Language Model) Applications

Hong Kong 7 days agoFull-time External
445k - 623k / yr
Position Overview We are looking for a Data Scientist / AI Algorithm Engineer with strong expertise in Large Language Models (LLMs) and their applications to join our AI team. As a Data Scientist at PowerArena, you will work on projects about smart factory and smart city, and tackle a diverse range of challenges. In this role, you will design, develop, and deploy state-of-the-art LLM-based solutions for real-world industrial applications, focusing on areas like knowledge retrieval, and AI agents. You will work closely with product, engineering, and research teams to bring cutting-edge LLM and Generative AI models from prototype to production. Key Responsibilities • Research, design, and implement LLM and Generative AI applications for industrial solutions. • Develop and optimize LLM application pipelines, including data preparation, prompt engineering, inference, and response post-processing. • Design and build robust LLM-based agents, including tool-use capabilities, planning, and memory modules. • Develop and manage Knowledge Base and Retrieval-Augmented Generation (RAG) systems for domain-specific information retrieval. • Collaborate with deployment and software engineers to integrate models into scalable, low-latency systems. • Collaborate with area experts to define the scoping of each project / function. • Collect, curate, and preprocess large-scale text datasets for fine-tuning and knowledge base population. • Benchmark and evaluate application performance; conduct ablation studies and drive improvements in accuracy, relevance, and robustness. • Stay up to date with the latest advances in LLMs, Generative AI, and context engineering, and apply them to solve business problems. • Share technical knowledge with team members. Qualifications • Master’s or Ph.D. in Computer Science, Electrical Engineering, or a related field (or equivalent practical experience). • Proven experience in developing and deploying applications using Large Language Models (LLMs). • Essential abilities in prompting, context engineering, and fine-tuning techniques. • Experience in developing LLM-based agents and knowledge base/RAG systems. • Familiarity with model efficiency techniques like LoRA (Low-Rank Adaptation) and knowledge distillation. • Strong proficiency in programming languages such as Python and familiarity with relevant libraries (e.g., PyTorch, TensorFlow, Hugging Face Transformers). • Excellent problem-solving skills and the ability to work independently and as part of a team. Bonus Qualifications • Familiarity with MLOps, model monitoring, and pipeline automation for LLMs. • Familiarity with vision models and computer vision solutions. • Background in smart factory, smart city, or industrial AI use cases. • Contributions to open-source LLM or AI agent projects. • Familiarity with Industry 4.0 applications. What We Offer • Flexible working environment • Competitive salary and benefits package. • Opportunity to work on cutting-edge projects with real-world impact. • Collaborative and innovative work environment. • Professional development and training opportunities.