Data Engineer (CDC & Legal ETL)

Location
D01 Cecil, Marina, People’s Park, Raffles Place
Job Type
Full-time
Experience
Mid
Category
General
Salary
$4,000 - $4,500
Posted
3 weeks ago
Expires
May 20, 2026
Views
6

Job Details

Vacancies

1 position

Experience Required

No experience required

Job Description

Role Overview

As our Data Engineer, you will own the lifeblood of our legal AI platform — the data pipeline that keeps our knowledge graph accurate and current. You will build CDC (Change Data Capture) pipelines to synchronize our Neo4j knowledge graph with Singapore’s official legal publications, implement data quality validation frameworks, and ensure version control across statutory amendments. When a new Gazette is published, your pipeline is what makes our AI know about it within days — not months.

Key Responsibilities

  • Design and implement CDC pipelines to capture incremental changes from Singapore Gazette, AGC consolidated statutes, and other authoritative legal sources

  • Build automated data ingestion workflows: scraping, parsing, structural analysis of legal documents (Act → Part → Division → Section → Subsection)

  • Implement temporal metadata extraction: effective dates, repeal dates, amendment lineage, and version tracking for every statutory provision

  • Develop data quality validation framework: automated checks for temporal conflicts, missing citations, entity mismatches, and cross-jurisdiction inconsistencies

  • Manage Neo4j graph data loading and incremental updates — updating specific nodes without full graph rebuilds

  • Build monitoring dashboards for data pipeline health: ingestion latency, error rates, coverage metrics

  • Implement data versioning and rollback capabilities for audit compliance

  • Collaborate with Backend Engineer on ETL-to-KAG integration and with QA on data accuracy validation

Requirements

  • 2+ years experience in data engineering, ETL development, or data pipeline architecture

  • Proficiency in Python and/or Go for data processing scripts and pipeline orchestration

  • Experience with graph databases (Neo4j preferred) or relational databases (PostgreSQL)

  • Hands-on experience with CDC tools or patterns (Debezium, Kafka Connect, or custom CDC)

  • Understanding of data quality frameworks and validation methodologies

  • Familiarity with AWS data services (S3, Glue, Lambda, or Step Functions)

  • Proficiency in English; Mandarin is a strong plus

  • Singapore Citizen or Permanent Resident (PR) required

Nice-to-Have

  • Experience with web scraping and document parsing (BeautifulSoup, Scrapy, or similar)

  • Background in legal data, regulatory data, or structured document processing

  • Experience with workflow orchestration tools (Airflow, Dagster, Prefect)

  • Knowledge of NLP-based entity extraction or named entity recognition

Similar Jobs

ALTIUS ORG

🤡Client Engagement Crew [Mentorship + Travel]

ALTIUS ORG Islandwide 14 hours ago
EMINENCE ORGANIZATION PTE. LTD.

[🌠ENTRY LEVEL🌠] CAMPAIGN SPECIALIST

EMINENCE ORGANIZATION PTE. LTD. D01 Cecil, Marina, People’s Park, Raffles Place 14 hours ago

Security Detection & SIEM Engineer

LUMINA ADVISORY & GLOBAL SEARCH PTE. LTD. D01 Cecil, Marina, People’s Park, Raffles Place 14 hours ago

Sales Manager

ASIA SEARCH PTE. LTD. D01 Cecil, Marina, People’s Park, Raffles Place 14 hours ago
SIMPLE RECRUIT

EVENTS & MARKETING (1-1 Mentorship)

SIMPLE RECRUIT D01 Cecil, Marina, People’s Park, Raffles Place 14 hours ago

Response Reality Check

Quality: 95%
Response N/A
Company Stats
Response metrics N/A
Platform Spread
mycareersfuture
95%
Quality Score
N/A
Response Rate

LAWGORITHM PRIVATE LIMITED

Ready to Apply?

This is a direct application to LAWGORITHM PRIVATE LIMITED. No recruitment agencies involved.

Apply for this Position

Response rate not available - Direct application to employer