import pandas as pd from azure.data.factory import * def build_pipeline(source, sink): df = pd.read_sql(query, conn) df = df.dropna(subset=['id']) df['quality_score'] = validate(df) return transform(df) class DataGovernance: def __init__(self, config): self.rules = load_rules(config) def profile(self, dataset): return {col: stats(dataset[col]) for col in dataset} SELECT d.*, q.score FROM datasets d JOIN quality_checks q ON d.id = q.dataset_id WHERE q.score > 0.95 ORDER BY d.created_at DESC; pipeline = DataPipeline( extract=SQLSource(conn), transform=[Clean(), Enrich(), Validate()], load=AzureSink(container) ) pipeline.run(schedule='daily')
Roham Chlak

// Hello World, I'm

Roham Chlak

رُهام شلاق

Data professional with hands-on experience in data engineering, data quality, governance, and analytics. Skilled in building scalable ETL pipelines, profiling datasets, and delivering actionable insights across enterprise organizations in the UAE. Passionate about turning messy data into clean, reliable, governed assets.

💼

work_experience

// career.log

Data Engineer - Manager

💼 Capgemini Invent
📅 Jun 2025 — Present📍 Dubai, United Arab Emirates

Leading data engineering initiatives and managing end-to-end data pipelines for enterprise clients. Driving cloud-based data platform modernization and implementing best practices for data architecture and governance across multiple engagement streams.

Data Quality Analyst

💼 General Pension and Social Security Authority (GPSSA)
📅 Jun 2023 — May 2025📍 Abu Dhabi, United Arab Emirates

Developed and published Open Data datasets along with comprehensive metadata and data dictionaries to support TDRA annual assessment. Conducted data profiling and identified data quality issues across multiple data sources. Designed dashboards and visualizations in SAS, and delivered daily and ad-hoc reporting to stakeholders.

Data Analyst

💼 Injazat Data Systems (a G42 Company)
📅 Jul 2022 — Jun 2023📍 Abu Dhabi, United Arab Emirates

Operated within the Data Office function, contributing to data governance, data management, and enterprise analytics. Performed data profiling, cleansing, and enrichment of enterprise datasets. Delivered analytical reports and dashboards for senior leadership decision-making.

Junior Data Analyst

💼 Genie AI Solutions
📅 Jan 2021 — Aug 2021📍 Beirut, Lebanon

Supported the analytics team in data extraction, transformation, and reporting. Developed scripts and queries for data processing using Python and SQL. Assisted in building automated data pipelines and creating visualization dashboards.

IT Intern

💼 Lebanon Opportunities
📅 Jun 2019 — Sep 2019📍 Beirut, Lebanon

Provided technical support and contributed to database management and internal reporting. Gained exposure to real-world IT infrastructure and data handling procedures.

Data Scientist & Developer

💼 Internal Security Forces (ISF)
📅 2013 — 2022📍 Lebanon

Python coding / Scripting / Machine learning algorithms / Spark (PySpark) / Hive (on top of Hadoop) / Elastic Search, Kibana / Siren (Graph) / NiFi (Data Transformation and Processing and also used as ETL).

Engineering Design & Manufacturing Developer

💼 Engineering Design & Manufacturing (EDM)
📅 2012 — 2013📍 Lebanon

Delphi Forms / Pascal Programming Language - working on their own system (EDM) for business solutions (P.O.S system, stock system, accounting system).

Developer

💼 Allied Computer Incorporation (ACI)
📅 2010 — 2011📍 Lebanon

Delphi Forms / Pascal Programming Language - Working on ARAM Accounting system (Saudi version).

Designer & Publisher

💼 Lebanese Preparatory School (LPS)
📅 2009 — 2009📍 Lebanon

Designing school agendas using Adobe Illustrator. Publication of computer knowledge books for grades 1 to 12 after they have been translated from English to French.

📚

education

// degrees.json

Master of Science in Data Analytics

University of Strathclyde

📅 2021 — 2022📍 Glasgow, United Kingdom

Specialized in data analytics, machine learning, and statistical modeling. Completed a dissertation focused on advanced data quality assessment methodologies.

Bachelor of Computer Science

Lebanese International University

📅 2016 — 2020📍 Beirut, Lebanon

Focused on software engineering, databases, and information systems. Built a strong foundation in programming, algorithms, and data structures.

services

// what_i_do.ts
🗄

Data Engineering

Python Scripting and PySpark for data processing and cleansing from different sources (CSV files, Excel, Access, MySQL, SQL Server, Oracle Database, etc.) to a big data ecosystem and visualization (Hadoop — HDFS file system — and ElasticSearch).

💻

Web Development

Back-end and Front-end with C# MVC 5 on Visual Studio with SQL Server database. Django web development. WordPress. Angular front-end applications. PHP & MySQL solutions.

🧠

Data Science / AI

Building Machine Learning and Deep Learning models with supervised data. Training and testing models for best accuracy. Statistical analysis using R and Python. NLP and predictive analytics.

🖥

Windows Applications

Java development for Windows OS applications. Delphi Forms / Pascal programming for business solutions including POS systems, stock management, and accounting systems.

👥

Training

Python course training for beginners and intermediates. Training on data processing, PySpark for big data processing, and Hadoop ecosystem. Big data architecture and pipeline design workshops.

🏆

trainings

// certifications.log

Ministry of Administrative Reform (OMSAR)

📍 Lebanon
📅 2020

Cybersecurity awareness training — keeping abreast of the latest threat intelligence and attack methods to mitigate cybersecurity uncertainty, eliminate risky behaviour, and install company-wide security best practices.

Softech

📍 Beirut, Lebanon
📅 2020

Using OSINT tools and Softech OSINT applications (PeopleMon, OSIntMon) for open-source intelligence gathering and analysis.

Cognitus SAS

📍 Dubai, UAE
📅 2019

NiFi for Data flow / Data transformation / Data Processing. Python Coding / Scripting for Data Processing. Design architecture and lead development of a distributed, scalable, and fully extensible smart solution for Big Data Management.

Cognitus SAS

📍 Versailles, France
📅 2018

Enhance the speed and efficiency of legacy solutions by analysing requirements, designing, and building big data architectures and data processing pipelines using SQL, Apache Spark (PySpark), Hive, and ElasticSearch — Kibana. Machine Learning / Deep Learning Algorithm implementation.

Cognitus SAS

📍 Paris, France
📅 2018

Statistical studies on Data using the R language on R Studio.

Cognitus SAS

📍 Beirut, Lebanon
📅 2018

Support the adoption of Big Data solutions by providing advice on the design and deployment of big data ecosystems for storage, streaming, processing, and visualisation (Hadoop ecosystem).

New Horizons

📍 Lebanon
📅 2017

Oracle 12c / EXADATA Storage database administration and management.

tech_stack

// skills.config
Programming
Python Java C# MVC5 PHP Angular Delphi / Pascal R SQL Pandas NumPy
Big Data
PySpark Apache Spark Hadoop / HDFS Hive NiFi SSIS ElasticSearch Kibana Siren Graph
Database
Oracle Database Oracle 12c / EXADATA Oracle Forms & Reports MySQL MS SQL Server PostgreSQL
Analytics
SAS Statistical Analysis
Visualization
Power BI Tableau SAS Visual Analytics
Cloud
Azure Data Factory Azure Synapse Azure Databricks AWS Glue
Domain
Data Governance Data Management Open Data Standards
Engineering
ETL Pipelines Data Migration Data Warehousing
Quality
Data Profiling Data Cleansing
Data Science
Machine Learning Deep Learning NLP
Web Development
Django WordPress
Tools
Adobe Illustrator OSINT Tools R Studio