Shire Jobs

Mobile Shire Logo

Job Information

Aetna Resources, Llc Lead Data Engineer in Alpharetta, Georgia

Aetna Resources LLC, a CVS Health company, is hiring for the following role in Alpharetta, GA: Lead Data Engineer to design, build, and manage large-scale data structures, pipelines and efficient Extract/Load/Transform (ETL) workflows to support business applications. Duties include: develop large scale data structures and pipelines to organize, collect and standardize data to generate insights and addresses reporting needs; write ETL (Extract/Transform/Load) processes, design database systems, and develop tools for real-time and offline analytic processing; collaborate with Data Science team to transform data and integrate algorithms and models into automated processes; leverage knowledge of Hadoop architecture, HDFS commands, and designing and optimizing queries to build data pipelines; utilize programming skills in Python, Java, or similar languages to build robust data pipelines and dynamic systems; build data marts and data models to support Data Science and other internal customers; integrate data from a variety of sources and ensure adherence to data quality and accessibility standards; analyze current information technology environments to identify and assess critical capabilities and recommend solutions; and experiment with available tools and advise on new tools to provide optimal solutions that meet the requirements dictated by the model/use case. Telecommuting available. Multiple positions.

Requirements -Bachelors degree (or foreign equivalent) in Computer Engineering, Data Engineering, Electronic Engineering, Engineering, Data Science, or a related field, and five (5) years of experience in the job offered or a related occupation. Requires three (3) years of experience with each of the following: healthcare data analytics including industry-standard formats (X12, HL7, XML, or flat files) and clinical data; Jenkins and GIT for CI/DI pipeline automation; designing and optimizing queries against data in an HDFS environment; Hive, Python, PySpark, and Hadoop; traditional RDBMS (Teradata, Oracle or DB2); SAS (Statistical Analysis System) including Datasets, Macro Facility, Formats and Informats, Functions, and Procedures; designing data models for analytical and reporting use cases; and providing domain support for healthcare or retail pharmacy organization.

DirectEmployers