Description:
Role: Data Engineer - Artificial Intelligence & Machine Learning
Location Options: Bay Area - CA
Responsibilities: -
1. Develop AI/ML Models:
• Design, build, and train machine learning models using appropriate algorithms (e.g., supervised, unsupervised, reinforcement learning, deep learning).
• Use various machine learning and AI frameworks (TensorFlow, PyTorch, Scikit-learn, Keras, etc.) to implement models.
• Experiment with different approaches (e.g., decision trees, neural networks, ensemble methods) and optimize for the best performance.
• Perform model selection, training, tuning, and validation using real-world data to achieve the most accurate results.
2. Data Preparation & Feature Engineering:
• Clean, preprocess, and structure raw data for analysis, ensuring it is suitable for model training.
• Implement data augmentation techniques, handle missing data, and remove outliers.
• Engineer features that will improve model performance, understanding the data's underlying relationships.
3. Algorithm Design and Optimization:
• Develop and optimize algorithms for specific use cases like image recognition, natural language processing (NLP), speech recognition, or recommendation systems.
• Optimize algorithms for high efficiency, scalability, and real-time performance.
• Regularly assess and improve model accuracy by experimenting with various hyperparameters, architectures, and optimization techniques.
4. Deploy Machine Learning Models:
• Collaborate with software engineers to deploy machine learning models into production environments, integrating them with existing systems.
• Ensure the models are scalable, performant, and able to handle real-time or batch data as required.
• Implement model monitoring and performance tracking tools to evaluate accuracy and detect any model drift over time.
5. Model Evaluation & Testing:
• Use cross-validation and other techniques to evaluate the model's generalization capabilities.
• Implement performance metrics to measure model accuracy, precision, recall, F1-score, and other relevant metrics based on project needs.
• Perform A/B testing and compare the performance of multiple models.
6. Continuous Improvement & Research:
• Stay up-to-date with the latest AI/ML research and advancements in the field, such as new algorithms, architectures, and technologies.
• Participate in code reviews and contribute to best practices in AI/ML development.
• Experiment with new AI and machine learning techniques to continually improve performance and solve complex problems.
7. Collaboration & Communication:
• Work closely with Data Scientists, Software Engineers, Product Managers, and other stakeholders to understand business problems and translate them into machine learning tasks.
• Communicate findings, insights, and progress to non-technical stakeholders in a clear, understandable manner.
• Collaborate on projects, providing expertise on AI/ML concepts to help shape product features or solutions.
8. Ethical Considerations and Bias Mitigation:
• Ensure that the models and algorithms are free from biases and ethically sound, particularly when dealing with sensitive data.
• Evaluate fairness, transparency, and interpretability of models, especially in critical applications like healthcare, finance, and legal sectors.
9. Documentation:
• Document the model-building process, algorithm choice, and data used, ensuring reproducibility and transparency.
• Write clear technical documentation and user guides to facilitate collaboration and knowledge transfer.
10. Innovation and Prototyping:
• Prototype AI-driven solutions to demonstrate their potential and feasibility.
• Develop proof of concepts (PoCs) and new algorithms for emerging AI and ML technologies (e.g., federated learning, reinforcement learning, generative models)
Qualifications:
1.Educational Background:
Bachelor's or Master's degree in Computer Science, Data Science, Artificial Intelligence, Mathematics, or a related field.
Ph.D. in a relevant field is a plus but not required
2.Technical Skills:
• Strong programming skills in Python, R, or similar languages for machine learning and data analysis.
• Deep knowledge of machine learning libraries such as TensorFlow, PyTorch, Scikit-learn, Keras, and XGBoost.
• Strong foundation in linear algebra, probability, statistics, and optimization techniques.
• Proficiency in algorithms and data structures.
• Experience with deep learning techniques, including convolutional neural networks (CNNs), recurrent neural networks (RNNs), transformers, and reinforcement learning.
• Familiarity with NLP techniques, computer vision, time series analysis, and other AI sub-domains.
• Knowledge of data preprocessing, feature extraction, and feature selection techniques.
• Proficiency in cloud platforms (AWS, Azure, Google Cloud) for model training and deployment.
• Experience with containerization and orchestration tools (e.g., Docker, Kubernetes) is a plus.
• Familiarity with big data technologies (Hadoop, Spark, etc.) is a plus.
3.Soft Skills:
• Strong problem-solving and analytical skills.
• Ability to work in a collaborative, cross-functional environment.
• Effective communication skills to explain complex AI/ML concepts to non-technical stakeholders.
• Strong attention to detail and ability to troubleshoot and debug code.
• Passion for continuous learning and staying up-to-date with AI and ML advancements.
4.Experience:
• Proven experience (3+ years) in developing and deploying machine learning or AI models in a production environment.
• Familiarity with MLOps (machine learning operations) principles, such as model versioning, CI/CD pipelines for ML, and model monitoring in production
5.Preferred Qualifications:
• Experience with reinforcement learning, unsupervised learning, or generative models (e.g., GANs).
• Knowledge of ethics in AI, such as mitigating bias and ensuring fairness in models.
Familiarity with NLP libraries like SpaCy, NLTK, Hugging Face Transformers, etc.
• Experience in building and deploying AI-powered products in a commercial setting.
• Knowledge of edge computing and deploying AI models on edge devices
6.Work Environment:
• Collaborative and fast-paced work environment.
• Opportunity to work with state-of-the-art technologies.
• Supportive and dynamic team culture
• The position may require collaborating across multiple teams, including product, engineering, and research groups, to develop and implement AI/ML solutions
#LI-AD1