Senior Azure Data Engineer

Website MBVA 24-7

Client: Alumus
Job Title: Senior Azure Data Engineer
Headcount: 1
Basic Hourly Rate: $7.50
Job Type: Full-Time
Work Schedule: Monday – Friday, 8:30 am to 5:00 pm AZ Time

FILIPINO APPLICANTS ONLY

Job Responsibilities:

  1. Azure-Native Data Engineering
  • Design, build, and maintain production-grade data pipelines on the Microsoft Azure platform using Azure Data Factory (ADF), Azure Databricks, Azure Synapse Analytics, and related services.
  • Architect and manage Azure Data Lake Storage (ADLS Gen2) and Azure SQL Data Warehouse environments with a focus on scalability, performance, and cost efficiency.
  • Implement CI/CD pipelines for data infrastructure using Azure DevOps and GitHub Actions.
  • Ensure environments follow Azure best practices for security, governance, and compliance—including RBAC, private endpoints, and encryption at rest and in transit.
  1. API Integration & Web Application Data Flows
  • Design, implement, and maintain robust integrations with RESTful and SOAP APIs, including authentication handling (OAuth 2.0, API keys, JWT), rate-limiting strategies, and error recovery.
  • Build and manage data ingestion pipelines that consume data from internal and third-party web applications, including real-time and event-driven flows using Azure Event Hubs or Azure Service Bus.
  • Develop and maintain Python- or .NET-based microservices and Azure Functions to support lightweight data workflows and API wrappers.
  • Monitor API health, data contracts, and schema evolution to proactively prevent pipeline failures downstream.
  1. Data Pipelines, Warehousing & Cleanup
  • Own the full lifecycle of ETL/ELT pipelines: ingestion, cleansing, validation, deduplication, transformation, and loading into analytical targets.
  • Implement sophisticated data matching and entity resolution logic to reconcile patient, provider, and facility records across disparate healthcare systems.
  • Design and enforce data quality frameworks—including anomaly detection, completeness checks, and lineage tracking—to ensure trustworthy data at every layer.
  • Optimize query performance and storage costs across Azure Synapse, Azure SQL, and Databricks Delta Lake environments.
  1. Data Modelling & Warehouse Design
  • Design scalable, maintainable data models (dimensional, relational, and lakehouse patterns) that serve both operational reporting and long-term analytical needs.
  • Build and manage transformation logic using dbt (Data Build Tool) with full test coverage, documentation, and version control.
  • Maintain a clear separation between raw, staging, and curated data layers to support auditability and iterative analytics development.
  1. Analytics, Reporting & Data Science Enablement
  • Partner with analysts, data scientists, and clinical stakeholders to translate business and clinical questions into robust data products.
  • Build reusable, well-documented datasets and semantic layers in Power BI or Azure Analysis Services that support self-service analytics.
  • Apply a data science mindset to pipeline design—understanding how downstream ML models, statistical analyses, or predictive tools will consume and depend on your data.
  • Contribute to long-term analytics roadmaps: anticipating future reporting needs, designing for extensibility, and proactively surfacing data insights to leadership.
  1. Healthcare Data Expertise
  • Handle sensitive Protected Health Information (PHI) in compliance with HIPAA and relevant data privacy regulations.
  • Integrate with Electronic Medical Record (EMR) systems, including HL7 FHIR APIs, HL7 v2 message formats, and proprietary vendor exports.
  • Apply domain knowledge (where applicable) to identify data quality issues specific to clinical workflows, coding standards (ICD-10, CPT, SNOMED), and insurance/claims data.
  1. Governance, Security & Compliance
  • Enforce data governance standards: cataloging assets in Azure Purview, managing lineage, and maintaining a business glossary.
  • Implement and audit access controls, data masking, and audit trails for all sensitive data environments.
  • Participate in security reviews and ensure pipelines meet organizational and regulatory compliance requirements.

Job Qualifications:

  • Bachelor’s degree in Computer Science, Information Technology, Data Engineering, Data Science, or a related field.
  • Equivalent practical experience with a demonstrable portfolio of work will be equally considered.
  • 5+ years of experience in a data engineering, analytics engineering, or cloud data architecture role.
  • Demonstrable, hands-on experience with the Microsoft Azure data stack (ADF, Synapse, Databricks, ADLS, and Azure SQL)—this is a core requirement.
  • Proven track record building and maintaining production API integrations and data flows from web applications.
  • Experience with end-to-end pipeline development: ingestion, transformation, warehousing, and BI layer delivery.
  • Healthcare or medical data experience is highly advantageous but not required.
  • Azure Data Factory, Azure Databricks (PySpark), Azure Synapse Analytics, ADLS Gen2, Azure Functions, Azure Event Hubs / Service Bus, Azure DevOps: Azure Platform
  • Dimensional modelling, Delta Lake / Lakehouse architecture, dbt, Azure Synapse or Azure SQL: Data warehousing & modeling
  • REST and SOAP API design and consumption, OAuth 2.0, JSON/XML parsing, Webhook and event-driven data patterns: API integration
  • Python (pandas, PySpark, requests, SQLAlchemy) – advanced proficiency required; T-SQL and Spark SQL: Programming Languages
  • Power BI (including DAX, Power Query, semantic modeling) and Azure Analysis Services: BI & Reporting
  • Data validation frameworks, Azure Purview, data lineage, anomaly detection: Data Quality & Governance
  • Git, Azure DevOps pipelines, CI/CD for data infrastructure, Infrastructure as Code (Bicep or Terraform a plus): DevOps & Version Control

Hardware and Software Requirements:

Hardware:

  • AMD Ryzen 3 3200G APU or Intel Core i5 (at least 7th gen) Intel CPU
  • 16GB DDR4 RAM and 120 – 480gb SSD
  • Jabra Biz 1100 / Logitech H390 or similar Noise-canceling Headset
  • 1 – 2 Monitors (at least 21in)
  • 1080p HD Webcam
  • Internet Speed: 50mbps (Fiber/DSL) LTE not accepted

Software:

  • Genuine Windows 10 Licensed Computer
  • Microsoft Office Suite