Interested in working with large data sets? Ready to employ your data engineer skills with a modern consulting firm working with a growing life science company?
Our client, a consultancy that specializes in strategy and technology in business transformations, is currently embarking on an engagement with a life science company that has recently acrued new data sets to add to their existing data warehouse. They are looking for an ETL Engineer to come in and help transform and join the data sets together into Amazon S3. This highly skilled individual will be well versed in Apache Spark and PySpark, as well as have secure knowledge of Jupyter Hub and have experience implementing Amazon Elastic MapReduce(EMR).
ETL Engineer Requirements:
Advanced experience in using PySpark and Apache Spark, 5+ years
Noted success in data transformation and joining to create large data sets
Experience in data warehousing and using Amazon EMR to process and analyze data
Documented experience in building pipelines in PySpark
Exposure to working with life science data a plus
Helpful to have worked with predictive analytics and machine learning
This is a consultant to hire position in the Philadelphia area.