AWS Data Engineer (Data Modelling, Financial Sector with …, Mumbai
AWS Data Engineer (Data Modelling, Financial Sector with …, Mumbai
-
Mumbai, India
-
Posted: less than a week ago
-
Save
Description
A US/Canadian based IT MNC is hiring AWS Data Engineer/ Data Modeller for one of its Banking Client. Mandatory Skills: AWS Data Engineering, Data Modelling, Glue, Airflow, Kafka, Experience working with BIAN (Banking Industry Architecture Network) Framework Location: Remote Time Overlap: Till 11 PM ISTExperience in Data Engineering, for a customer data repository project. Responsible for modeling data aligned with the BIAN architecture, translating business requirements and ensuring integration with CRM. Essential knowledge in AWS technologies: Aurora PostgreSQL, S3 Lakehouse (Iceberg), Glue, Debezium CDC, MSK.
Advanced English is essential. We value experience in the financial sector and familiarity with BIAN. Focus on data quality and collaboration with engineering is expected for efficient pipelines. Required skill: 1. Lakehouse Data Modeling on Amazon S3 oDesign Medallion architecture (Bronze/Silver/Gold) oModel data for scalability, partitioning, and domain-based access oHandle schema evolution and time-travel use cases2.AWS Glue + PySpark (ETL Modeling) oTranslate logical/physical models into PySpark transformations oOptimize joins, partition pruning, pushdown predicates oManage schema via Glue Data Catalog3. Schema Design & Metadata Management oDefine canonical schemas and data contracts oMaintain centralized metadata using Glue Catalog oVersioning and backward compatibility of schemas4. Modern Table Formats (Apache Iceberg / Delta) oImplement ACID-compliant tables on S3 oDesign for incremental loads, CDC, and snapshot-based querying oOptimize compaction and partition strategies5. Streaming & CDC Data Modeling (Kafka / MSK) oDesign event schemas aligned with domain models oModel change data capture flows into lakehouse oEnsure consistency between streaming and batch layers6. Advanced Data Modeling Techniques oData Vault 2.0 (Hubs, Links, Satellites) oDimensional modeling (Star/Snowflake) oSCD (Type 1/2/3), surrogate keys, historization7. Data Governance & Quality Engineering oData lineage, cataloging, metadata-driven pipelines oData quality frameworks (Great Expectations, Deequ) oRBAC, audit, compliance8. Lakehouse & Medallion Architecture oBronze (raw CDC), Silver (conformed), Gold (business-ready) oSchema evolution, late arriving data, deduplication9. Orchestration & Pipeline Engineering oApache Airflow (DAG design, dependency mgmt, SLA handling) oHybrid orchestration (event + schedule driven) oCI/CD for data pipelines10. Canonical & Contract-First Data Design oCanonical schemas, data contracts, schema versioning oAPI/event schema alignment (Avro/JSON/Protobuf)11. Domain-Centric Data Modeling oNice to have experience BIAN-aligned service domains (www. Bian. Org) oDomain-driven design with explicit data ownership and boundaries Apply on Kit Job: kitjob.in/job/4n5xua
Advanced English is essential. We value experience in the financial sector and familiarity with BIAN. Focus on data quality and collaboration with engineering is expected for efficient pipelines. Required skill: 1. Lakehouse Data Modeling on Amazon S3 oDesign Medallion architecture (Bronze/Silver/Gold) oModel data for scalability, partitioning, and domain-based access oHandle schema evolution and time-travel use cases2.AWS Glue + PySpark (ETL Modeling) oTranslate logical/physical models into PySpark transformations oOptimize joins, partition pruning, pushdown predicates oManage schema via Glue Data Catalog3. Schema Design & Metadata Management oDefine canonical schemas and data contracts oMaintain centralized metadata using Glue Catalog oVersioning and backward compatibility of schemas4. Modern Table Formats (Apache Iceberg / Delta) oImplement ACID-compliant tables on S3 oDesign for incremental loads, CDC, and snapshot-based querying oOptimize compaction and partition strategies5. Streaming & CDC Data Modeling (Kafka / MSK) oDesign event schemas aligned with domain models oModel change data capture flows into lakehouse oEnsure consistency between streaming and batch layers6. Advanced Data Modeling Techniques oData Vault 2.0 (Hubs, Links, Satellites) oDimensional modeling (Star/Snowflake) oSCD (Type 1/2/3), surrogate keys, historization7. Data Governance & Quality Engineering oData lineage, cataloging, metadata-driven pipelines oData quality frameworks (Great Expectations, Deequ) oRBAC, audit, compliance8. Lakehouse & Medallion Architecture oBronze (raw CDC), Silver (conformed), Gold (business-ready) oSchema evolution, late arriving data, deduplication9. Orchestration & Pipeline Engineering oApache Airflow (DAG design, dependency mgmt, SLA handling) oHybrid orchestration (event + schedule driven) oCI/CD for data pipelines10. Canonical & Contract-First Data Design oCanonical schemas, data contracts, schema versioning oAPI/event schema alignment (Avro/JSON/Protobuf)11. Domain-Centric Data Modeling oNice to have experience BIAN-aligned service domains (www. Bian. Org) oDomain-driven design with explicit data ownership and boundaries Apply on Kit Job: kitjob.in/job/4n5xua
Highlights
-
Company namePrasha Consultancy Services Private
-
Job positionAWS Data Engineer (Data Modelling, Financial Sector with BIAN Architecture) (Mumbai)
Safety Tips
If the salary for a position is far above normal, proceed with caution.
More info about this ad
AWS Data Engineer (Data Modelling, Financial Sector with … has been posted in the Dhārāvi Design & Architecture category on Locanto.
Right now, this is the only ad posted in this category in Dhārāvi.
Interested in more? Widen your search to view ads in nearby areas of Dhārāvi. This includes Design & Architecture in Chembur, Andheri East and Kurla. There are more ads within a 15 km radius for this category. If you want to view those ads, click here.