Match score not available

Sr. Data Engineer

Remote: 
Full Remote
Contract: 
Work from: 

Offer summary

Qualifications:

Bachelor's degree in Computer Science or similar from a top tier school., 7+ years of experience with Azure DevOps and Azure Cloud Platform., Hands-on experience with Python, Pyspark, and building data pipelines., Strong technical, analytical, and interpersonal skills..

Key responsabilities:

  • Meet with stakeholders to understand project goals and requirements.
  • Recommend architecture and ETL design patterns aligned with organizational objectives.
  • Collaborate with developers to ensure product deliverables are met.
  • Participate in design discussions and develop proof of concepts for big data applications.

Fusemachines logo
Fusemachines SME https://bit.ly/
201 - 500 Employees
See all jobs

Job description

About Fusemachines
Fusemachines is a 10+ year old AI company, dedicated to delivering state-of-the-art AI products and solutions to a diverse range of industries. Founded by Sameer Maskey, Ph.D., an Adjunct Associate Professor at Columbia University, our company is on a steadfast mission to democratize AI and harness the power of global AI talent from underserved communities. With a robust presence in four countries and a dedicated team of over 400 full-time employees, we are committed to fostering AI transformation journeys for businesses worldwide. At Fusemachines, we not only bridge the gap between AI advancement and its global impact but also strive to deliver the most advanced technology solutions to the world.

This is full-time role.

About the role:

This is a full-time position responsible for leading, designing, building, and maintaining the infrastructure required for data integration, storage, processing, and analytics (BI, visualization and Advanced Analytics) using Microsoft Azure in the Media domain.

We are seeking a Senior Data Engineer with hands-on Python, and Pyspark experience and proven abilities to support software development activities in an Agile software development lifecycle.

We are seeking a well-rounded architect for a cloud based big data application using a variety of technologies. The ideal candidate will possess strong technical, analytical, and interpersonal skills. In addition, the candidate will collaborate with developers on the team to achieve architecture and design objectives as agreed with stakeholders.

Qualification & Experience

  • Must have a full-time Bachelor's degree in Computer Science or similar from a top tier school.
  • 7+ years of experience with Azure DevOps, Azure Cloud Platform, or other hyperscalers.
  • At least 7 years of experience as a data engineer with strong expertise in Azure, working on generation of big datasets using different data sources, in the Media industry.
  • Proven experience delivering projects and products for Data and Analytics as a data engineer.

Required skills/Competencies

  • Hands-on 7 yrs experience Python and Pyspark, Jupyter Notebooks, Python.
  • Familiarity with Databricks. Azure Databricks is a plus.
  • Familiarity with data cleansing, transformation, and validation.
  • Proven architecture skills on Big Data projects.
  • Hands-on experience with a code versioning tool such as GitHub, Bitbucket, etc.
  • Hands-on experience building pipelines in GitHub (or Azure Devops, Github, Jenkins, etc.)
  • Hands-on 3yrs experience with Spark.
  • Strong written and verbal communication skills.
  • Self-motivated and ability to work well in a team.

Responsibilities

  • Meet with stakeholders to understand the big picture and asks.
  • Recommend architecture aligned with the goals and objectives of the product/organization.
  • Recommend standard ETL design patterns and best practice.
  • Participate in the detail design and architectural discussions as well as customer requirements sessions to support the implementation of code and procedures for our big data product.
  • Design and develop proof of concept/prototype to demonstrate architecture feasibility.
  • Collaborate with developers on the team to meet product deliverables.
  • Must have familiarity with data science algorithms/tech stack. Any one of the languages: SAS, SPSS code or R-code.
  • Work independently and collaboratively on a multi-disciplined project team in an Agile development environment.
  • Ability to identify and solve for code/design optimization.
  • Learn and integrate with a variety of systems, APIs, and platforms.
  • Interact with a multi-disciplined team to clarify, analyze, and assess requirements.
  • Be actively involved in the design, development, and testing activities in big data applications.

Fusemachines is an Equal Opportunities Employer, committed to diversity and inclusion. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, or any other characteristic protected by applicable federal, state, or local laws.

Required profile

Experience

Spoken language(s):
Pashto
Check out the description to know which languages are mandatory.

Other Skills

  • Self-Motivation
  • Teamwork
  • Communication

Data Engineer Related jobs