We are looking for a Data Engineer to manage and optimize our data pipeline for machine learning purposes. The ideal candidate will be responsible for ensuring data quality, availability, and efficient processing to support the development and deployment of machine learning models.
As a Data Engineer, you will be responsible to:
• Design, implement, and maintain data pipelines to collect, process, and store large datasets.
• Ensure data quality and integrity through rigorous validation and cleaning processes.
• Collaborate with machine learning engineers to prepare datasets for model training and evaluation.
• Manage data storage solutions, including relational and NoSQL databases.
• Optimize data workflows for performance and scalability.
• Optimize existing databases (MySQL) and apply necessary indexing.
About You:
• Bachelor’s degree in Computer Science, Information Systems, or a related field.
• Proven experience in data engineering, including ETL processes and data pipeline management.
• Proficiency in SQL and familiarity with NoSQL databases.
• Experience with AWS data services such as Glue, Redshift, and Athena.
• Strong understanding of data modeling, warehousing, and processing techniques.
• Experience with big data technologies like Hadoop, Spark, or Kafka.
• Knowledge of scripting languages such as Python or Bash for automating data tasks.
If this role seems like the right fit for you, please apply by sending your CV via e-mail at
hr@admin22.com