Hi,
My name is Hamid Raza and I am a Staffing Specialist at Sonitalent llc. I am reaching out to you on an exciting job opportunity with one of our clients.
Position: Sr. Domain Architect
Duration: 6 months+
Location: Remote
Visa: USC, GC only
Interview: 1 video interview via teams with Hiring Manager
Interview: 1 video interview via teams with Hiring Manager
Must Haves: (see Description below)
Key Responsibilities:
Architect & Engineer to help us in the following areas
- Design Security Architecture
- Define data security measures to ensure least-privileged access.
- Manage data access migration from Azure AD groups and SQL accounts to Unity Catalog.
- Define roles and responsibilities for managing security at metastore, catalog, and schema levels.
- Manage access to Databricks features.
- Document security architecture (Databricks Security Architecture.docx).
- Revise Reporting Architecture
- Document interactions between Tableau, Power BI, and data stored in Delta Lake.
- Organize and allocate Databricks compute resources.
- Apply security measures to reporting architecture.
- Revise Unity Catalog Architecture
- Document design for storing, accessing, and sharing data assets with Databricks Unity Catalog.
- Plan migration strategy and changes to archiving, backups, and data policies.
- Optimize storage and manage sandboxes for different departments/projects.
- Document Unity Catalog design (Unity Catalog Design.docx).
- Revise EDP Architecture
- Update system architecture post-removal of Azure Synapse dedicated SQL pool.
- Define user access methods and schema change notifications.
- Ensure compliance with Sentara data security policies.
- Improve data update/read processes in the ODS.
- Address Digital Solutions’ need for FHIR-spec’d data feed.
- Design Extract Architecture
- Document tools, methods, and security for data extraction from EDP.
- Evaluate compatibility of existing File Extract feature with Databricks.
- Architect Real-Time Ingestion Capabilities
- Support real-time data ingestion using Azure Event Grid.
- Architect Real-Time Data Mirroring/Transactional Data Engine
- Serve data from Delta Lake to other transactional systems.
- Explore SQL IAAS instance and REST API workloads for read/write/commit/rollback operations.
- Evaluate Data Catalog Architecture
- Assess Unity Catalog features compared to Purview.
- Determine integration needs with Purview.
- Design Support for Multiple Test Environments
- Design data storage and access for multiple test environments.
- Define naming conventions and access mechanisms for different environments.
- Configure tools like Power BI and SSIS for environment switching.
- Develop ingestion approach for loading data into Delta Lake and Synapse.
- Manage code across all environments/schemas.
- Design Deployment Architecture
- Define components and processes for provisioning, securing, allocating, monitoring, and scaling Databricks resources.
- Monitor storage account bandwidth and understand limitations.
- Document deployment architecture.
- Update Testing Strategy for Ingestion
- Review and improve ETL testing approaches.
- Enhance testing for ingestion, datamart pipeline, datamart ETL, and file extracts.
- Design De-Identification Strategy
- Improve de-identification processes to meet requirements.
- Ensure key identifiers persist across multiple sources during de-identification.
- Apply de-identification at the time of ingestion.
Requirements:
· 15+ years industry experience building and supporting large-scale distributed systems
· 12+ years Expertise in Databricks and Python (10+ years)
· 10+ years Expertise in Azure and working with services such as Azure Data Factory and Azure Functions
· BS/MS/MS/PhD in Computer Science or related majors
· Great Communication Skills
Thanks,
Hamid Raza
Sonitalent Corp || https://www.sonitalentcorp.com/
5404 Merribrook Lane, Prospect, KY, USA