Share
Key job responsibilities
- Create and maintain datasets in data lake and internal data management systems using S3/Glue.A day in the lifeYou will utilize large-scale compute platform to build big datasets used in distributed systems for machine learning and statistical analysis.
Our Data Engineer needs to be able to gather and understand data requirements, build and maintain big data sources to prepare data for machine learning models,
- 3+ years of data engineering experience
- Experience with data modeling, warehousing and building ETL pipelines
- Experience with SQL
- Experience with one or more query language (e.g., SQL, PL/SQL, DDL, MDX, HiveQL, SparkSQL, Scala)
- Experience with one or more scripting language (e.g., Python, KornShell)
- Experience with big data technologies such as: Hadoop, Hive, Spark, EMR
- Experience with any ETL tool like, Informatica, ODI, SSIS, BODI, Datastage, etc.
These jobs might be a good fit