Role: ETL lead with PySpark and AWS
Location: Minneapolis, MN (Onsite)
Duration: Long Term
Job Description :
ETL Developer (5+years) to create data pipeline ETL Jobs using AWS Glue and PySpark within the financial services industry.
Responsibilities: Work with a scrum team(s) to deliver product stories according to priorities set by the business and the Product Owners. Interact with stakeholders. Provide knowledge transfer to other team members. Creating and testing pipeline jobs locally using aws glue interactive session. Performance tuning of PySpark jobs. AWS Athena to perform data analysis on Lake data populated into aws glue data catalog through aws glue crawlers. Must Haves: Responsible for designing, developing, and maintaining ETL processes to support data integration and business intelligence initiatives. Need to closely work with stakeholders to understand data requirements and ensure efficient data flow and transformation using ETL tools and PySpark Develop and implement ETL processes using with one of ETL tool and PySpark to extract, transform, and load data.4+ years of experience in ETL development with knowledge on Pyspark5+ years as an ETL DeveloperSQL expertAWS Glue WITH Python ( PySpark )PySpark Dataframe APISpark SQLKnowledge in AWS services (e.g. DMS, S3, RDS, Redshift, Step Function).Nice to Haves: Etl development experience with tools e.g. SAP BODS, Informatica. Good understanding of version control tools like Git, GitHub, TortoiseHg. Financial services experience Agile
Regards
Sagar Bhardwaj
Sr. Technical Recruiter
300 Alexander Park |Suite #200|Princeton , NJ 08540
Office: +1 7324521006 Ext238
Email: [email protected]| URL: http://www.diverselynx.com