Job Title: Data Engineer
Duration: 6+ Months Contract
Location: Remote India
Duties And Responsibilities
Essential Functions:
· Create and maintain optimal data pipeline architecture.
· Assemble large, complex data sets that meet functional / non-functional business requirements.
· Identify, design, and implement internal process improvements: automating manual processes, optimizing data delivery, re-designing infrastructure for greater scalability, etc.
· Build the infrastructure required for optimal extraction, transformation, and loading of data from a wide variety of data sources using SQL and AWS ‘big data’ technologies.
· Build analytics tools that utilize the data pipeline to provide actionable insights for Information Asset as well as Information Asset Customers’ operational efficiency and other key business performance metrics.
· Work with stakeholders including the Executive, Product, Data and Design teams to assist with data-related technical issues and support their data infrastructure needs.
· Create data tools for analytics to build and optimize offering into an industry leading solution
· Work with data and analytics experts to strive for greater functionality data systems.
Position Requirements
· Advanced working SQL knowledge and experience working with relational databases, query authoring (SQL) as well as working familiarity with a variety of databases.
· Experience building and optimizing ‘big data’ data pipelines, architectures and data sets.
· Experience performing root cause analysis on internal and external data and processes to answer specific business questions and identify opportunities for improvement.
· Strong analytic skills related to working with unstructured datasets.
· Build processes supporting data transformation, data structures, metadata, dependency and workload management.
· A successful history of manipulating, processing and extracting value from large, disconnected datasets.
· Working knowledge of message queuing, stream processing, and highly scalable ‘big data’ data stores.
· Strong project management and organizational skills.
· Excellent writing and verbal communication skills and a high level of customer orientation
Working Environment/Travel Requirements
This is a remote working position and you will be expected to work out of a home office with high speed internet and phone connection to enable seamless virtual presence.
Qualifications
Required Education and Experience:
· Bachelor’s degree in computer science or information technology, or equivalent work experience.
· 3+ years as a Data Engineer.
· Data engineering certification (e.g. IBM Certified Data Engineer, AWS Certified Data Analytics - Specialty) is a plus.
· Must have technical expertise with data models, data mining, and segmentation techniques.
· Experience using any of all of a combination of the following software/tools: • Relational SQL and NoSQL databases, including Postgres and Cassandra.
· Data pipeline and workflow management tools: Azkaban, Luigi, Airflow, etc.
· Data Storage Technologies like Amazon S3, Snowflake/Redshift, Hive, HDFS
· Stream-processing systems: Storm, Spark-Streaming, etc.
· Object-oriented/object function scripting languages: Python/ pySpark, Java, C++, Scala, etc.
· Data warehousing using Oracle/Snowflake.
· ETL Technologies like -Apache Spark/AWS Glue/ Databricks, Apache Airflow, Apache Hadoop.
· Experience supporting and working with cross-functional teams in a dynamic environment.
Regards,
Nick Arthur (Nizam)
Associate Director, Recruitment
Pull Skill Technologies India.
+91 7416706136
LinkedIn: linkedin.com/in/nick-arthur-83937b62