Offer summary

Qualifications:

4+ years of experience in data engineering and large-scale data processing pipelines., Proficiency in distributed storage and processing platforms like HDFS, Spark, and Hadoop., Solid understanding of data modeling, machine learning, and AI concepts., Fluency in programming languages such as Node.js, Java, or Python, with a relevant degree in engineering..

Key responsibilities:

Understand business requirements to align with data storage and computing technologies.

Create and deploy complex data processing pipelines in production environments.

Design scalable implementations of data models developed by Data Scientists.

Maintain documentation on data models and troubleshoot data quality issues.

Job description

Ascendeum is looking for veterans with extensive hands-on experience in the field of data engineering to build cutting-edge solutions for large-scale data extraction, processing, storage, and retrieval.

About Us:

We provide AdTech strategy consulting to leading Internet websites and apps globally hosting over 200 million monthly worldwide audiences. Since 2015, our team of consultants and engineers have been consistently delivering intelligent solutions that enable enterprise-level websites and apps to maximize their digital advertising returns.

Job Responsibilities:

Understand long-term and short-term business requirements to precisely match them with the capabilities of different distributed storage and computing technologies from the plethora of options available in the ecosystem.
Create complex data processing pipelines.
Design scalable implementations of the models developed by our Data Scientists.
Deploy data pipelines in production systems based on CICD practices.
Create and maintain clear documentation on data models/schemas as well as transformation/validation rules.
Troubleshoot and remediate data quality issues raised by pipeline alerts or downstream consumers.

Desired Skills and Experience:

4+ years of overall industry experience building and deploying large scale data processing pipelines in a production environment
Experience building data pipelines and data centric applications using distributed storage platforms such as HDFS, S3, NoSql databases (Hbase, Cassandra, etc); and distributed processing platforms such as Hadoop, Spark, Hive, Oozie, Airflow, etc.
Hands on experience with MapR, Cloudera, Hortonworks, and/or Cloud (AWS EMR, Azure HDInsights, Qubole, etc.) based Hadoop distributions
Practical experience working with well know data engineering tools and platforms Kafka, Spark, Hadoop
Solid understanding of Data Modelling, ML and AI concepts
Fluent in programming languages like Nodejs/Java/Python
Education: B.E / B Tech /M tech / MS.

Thank you for your interest in joining Ascendeum.

Required profile