Data Engineering team is responsible for designing building and maintaining the Data Lake infrastructure including ingestion pipelines storage systems and internal tooling for reliable scalable access to market data.
Key Responsibilities:
• Ingestion & Pipelines: Architect batch stream pipelines (Airflow Kafka dbt) for diverse structured and unstructured marked data. Provide reusable SDKs in Python and Go for internal data producers.
• Storage & Modeling: Implement and tune S3 column-oriented and time-series data storage for petabyte-scale analytics; own partitioning compression TTL versioning and cost optimisation.
• Tooling & Libraries: Develop internal libraries for schema management data contracts validation and lineage; contribute to shared libraries and services for internal data consumers for research backtesting and real-time trading purposes.
• Reliability & Observability : Embed monitoring alerting SLAs SLOs and CI/CD; champion automated testing data quality dashboards and incident runbooks.
• Collaboration: Partner with Data Science Quant Research Backend and DevOps to translate requirements into platform capabilities and evangelise best practices.
Qualifications :
• 6 years of experience building and maintaining production-grade data systems with proven expertise in architecting and launching data lakes from scratch.
• Expert-level Python development skills (Go and C nice to have).
• Hands-on experience with modern orchestration tools (Airflow) and streaming platforms (Kafka).
• Advanced SQL skills including complex aggregations window functions query optimisation and indexing.
• Experience designing high-throughput APIs (REST/gRPC) and data access libraries.
• Solid fundamentals in Linux containerisation (Docker) and cloud object storage solutions (AWS S3 GCS).
• Strong knowledge of handling diverse data formats including structured and unstructured data with experience optimising storage strategies such as partitioning compression and cost management.
• English at C1 level - confident communication documentation and collaboration within an international team.
Additional Information :
• We work remotely from anywhere in the world with a flexible schedule.
• We offer compensation for health insurance sports activities and professional training.