My name is Adarsh, and I am a Technical Recruiter from Empower Professionals Inc. I wanted to reach out Data Scientist with GCP background role with one of our clients based in Philadelphia PA (Fully Onsite). Please let me know if you are available in the job market and interested in this role – if so, we can connect and speak further.
Role: Data Scientist with GCP background Duration: 12 Months Location: Philadelphia PA (Fully Onsite) (Relocation is available for nearby states only)
Job Description: We are seeking a highly skilled Data Engineer to design, build, and maintain scalable data platforms that enable large-scale ingestion, storage, processing, and analysis of structured and unstructured data. This role will focus on constructing data products (data lake / data warehouse), optimizing data pipelines, and implementing robust ETL workflows to support analytics, machine learning, and operational reporting. The ideal candidate will be proficient in distributed computing, cloud-based data architectures (GCP), and modern data processing frameworks. Experience with real-time data streaming (Kafka, Apache Beam), MLOps, and infrastructure automation (Terraform, Jenkins) is highly preferred. Key Responsibilities:
Data Platform & Architecture Development Design, implement, and maintain scalable data platforms for efficient data storage, processing, and retrieval. Build cloud-native and distributed data systems that enable self-service analytics, real-time data processing, and AI-driven decision-making. Develop data models, schemas, and transformation pipelines that support evolving business needs while ensuring operational stability. Apply best practices in data modeling, indexing, and partitioning to optimize query performance, cost efficiency, considering best practices for Sustainability. ETL, Data Pipelines & Streaming Processing Build and maintain highly efficient ETL pipelines using SQL, Python, to process large-scale datasets. Implement real-time data streaming pipelines using Kafka, Apache Beam, or equivalent technologies. Develop reusable internal data processing tools to streamline operations and empower teams across the organization. Write advanced SQL queries for extracting, transforming, and loading (ETL) data with a focus on execution efficiency. Ensure data validation, quality monitoring, and governance using automated processes and dashboards. MLOps & Cloud-Based Data Infrastructure • Deploy machine learning pipelines with MLOps best practices to support AI and predictive analytics applications. • Optimize data pipelines for ML models, ensuring seamless integration between data engineering and machine learning workflows. • Work with cloud platforms (GCP) to manage data storage, processing, and security. • Utilize Terraform, Jenkins, CI/CD tools to automate data pipeline deployments and infrastructure management. Collaboration & Agile Development Work in Agile/DevOps teams, collaborating closely with data scientists, software engineers, and business stakeholders. Advocate for data-driven decision-making, educating teams on best practices in data architecture and engineering. Required Skills & Qualifications • 5+ years of experience as a Data Engineer working with large-scale data processing. • Strong proficiency in SQL for data transformation, optimization, and analytics. • Expertise in programming languages (Python, Java, Scala, or Go) with an understanding of functional and object-oriented programming paradigms. • Experience with distributed computing frameworks. • Proficiency in cloud-based data engineering on AWS, GCP, or Azure. • Strong knowledge of data modeling, data governance, and schema design. • Experience with CI/CD tools (Jenkins, Terraform) for infrastructure automation. Preferred Qualifications • Experience with real-time data streaming (Kafka, or equivalent). • Strong understanding of MLOps and integrating data engineering with ML pipelines. • Familiarity with knowledge graphs and GraphQL APIs for data relationships. • Background in retail, customer classification, and personalization systems. • Knowledge of business intelligence tools and visualization platforms • Databricks Data Intelligence Platform,DevOps,ETL,Google Cloud Platform,IT operations,Python,business intelligence,cloud computing,cloud providers,data analysis,data intelligence,data mining,data processing,data science,information technology,public cloud,system administration,technology
Awaiting your quick response. Thanks!
P.S. Empower is a top vendor to clients such as Apex Systems LLC, Sogeti, Randstad, CapGemini, UST and more.
Thanks Adarsh Sharma Technical Recruiter | Empower Professionals Adarsh@empowerprofessionals.com …………………………………………………………………………………………………………………….. 100 Franklin Square Drive – Suite 104 | Somerset, NJ 08873 www.empowerprofessionals.com Certified NJ and NY Minority Business Enterprise (NMSDC)
Note: We respect your Online Privacy. This is not an unsolicited mail. Under Bills.1618 Title III passed by the 105th U.S. Congress this mail cannot be considered Spam as long as we include Contact information and a method to be removed from our mailing list. If you are not interested in receiving our e-mails then please reply with a “REMOVE” in the subject line and mention all the e-mail addresses to be removed with any e-mail addresses, which might be diverting the e-mails to you. We are sorry for the inconvenience. This e-mail and any files transmitted with it are for the sole use of the intended recipient(s) and may contain confidential and privileged information. If you are not the intended recipient(s), please reply to the sender and destroy all copies of the original message. Any unauthorized review, use, disclosure, dissemination, forwarding, printing or copying of this email, and/or any action taken in reliance on the contents of this e-mail is strictly prohibited and may be unlawful. To subscribe or unsubscribe: https://send.empowerprofessionals.com/newsletter/subscribe/647186e8-bcb0-4f73-8f80-cb3daff9ad90