Skip to main content

Projects

Real-time Fraud Detection Architecture

Financial Fraud Detection System

Designed and deployed a distributed event processing system handling 500K transactions/sec.

  • Clickable architecture diagram (Kafka→Flink→Redshift)
  • Performance comparison slider
  • Simulated transaction stream demo
GenAI Document Processing

GenAI Contract Analyzer

Engineered an LLM-powered document processing system automating key term extraction.

  • Upload sample PDF for key term extraction
  • Accuracy heatmap
  • Cost savings calculator
Clinical Data Pipeline

Clinical Data Modernization

Led migration of 22 legacy SAS pipelines to Delta Lake on Databricks.

  • FDA submission timeline reducer
  • Interactive data lineage explorer
  • PHI redaction demo

Technical Skills

Cloud technology

Cloud Expertise

Interactive Skill Explorer

Cloud Platforms:

  • AWS (Kinesis, Glue, Redshift, Lambda, EMR)
  • Azure (Databricks, Synapse, Purview)
  • GCP (BigQuery, Dataflow)

Big Data Stack:

  • Spark (PySpark/Scala)
  • Kafka, Flink, Airflow, dbt
  • Snowflake, Iceberg, Delta Lake, Hudi

Data Architecture:

  • Data Mesh, Medallion Architecture
  • CDC (Debezium), DWH Modeling (Data Vault 2.0)

ML/GenAI:

  • LLM Orchestration (LangChain, LlamaIndex)
  • Vector DBs (Pinecone), Feature Stores (Feast)

DevOps:

  • Terraform, Kubernetes
  • CI/CD (GitHub Actions, Jenkins), Docker, ArgoCD

Certification Badges:

AWS Certified Azure Certified Databricks Certified

Contact

Book a Consultation

Schedule a 15-min consultation to discuss your data engineering needs.

Secure Messaging

Social Proof

See what others say about my work.