Hi, I'm
Devvrat Rajesh Mungekar

Data Scientist | Data Engineer | AI Engineer

Engineer by structure, Scientist by curiosity, and AI innovator by passion - I design systems that think and scale.

Data Sources
ETL Process
Data Warehouse
Analytics

About Me

I'm a data-driven professional at the intersection of engineering, science, and AI — building scalable pipelines, crafting intelligent models, and deploying real-time solutions in the cloud.

With a strong foundation in modern data architectures and machine learning, I specialize in designing end-to-end systems that transform raw data into actionable intelligence. My expertise spans cloud platforms, big data processing, and AI integration — enabling organizations to harness data for smarter decisions and intelligent automation.

2+

Years
Experience

10+

Data Pipelines
Built

10+

Technologies
Mastered

Devvrat Rajesh Mungekar

Devvrat Rajesh Mungekar

Data Scientist at LTIMindtree Canada

Technical Skills

Data Storage & Warehousing

Azure SQLAzure SQL
Azure PostgreSQLAzure PostgreSQL
Azure Data Lake Gen 2Azure Data Lake Gen 2
Synapse AnalyticsAzure Synapse Analytics
MongoDBMongoDB (NoSQL)

Programming & ETL

PythonPython
PySparkPySpark
SQLSQL
KQLKQL
Shell ScriptingShell Scripting/Bash
Apache AirflowApache Airflow
dbtdbt

Cloud & Big Data

Microsoft FabricMicrosoft Fabric
Azure Data FactoryAzure Data Factory
Azure DatabricksAzure Databricks
Azure AI FoundryAzure AI Foundry
Azure Machine LearningAzure Machine Learning
Azure OpenAIAzure OpenAI
Azure AI ServicesAzure AI Services (Speech, Vision, Language)
DockerDocker

Analytics & Visualization

TableauTableau
StreamlitStreamlit
PlotlyPlotly
Geospatial ClusteringGeospatial Clustering
PandasPandas
NumPyNumPy

Professional Experience

Data Scientist

LTIMindtree Canada 2024 - Present
  • Designed and optimized end-to-end data pipelines using Azure Data Factory, Azure Databricks, Synapse Analytics, and ADLS Gen 2, improving throughput by 60%
  • Deployed LLMs including Azure OpenAI, Hugging Face, and Llama to support audience segmentation and prediction use cases, increasing model accuracy by 20%
  • Built and managed scalable, production-grade ETL pipelines using Python and PySpark across structured and unstructured data
  • Applied CI/CD practices (Azure DevOps, GitHub) and automated model deployments using Prompt Flow & Kubernetes
  • Oversaw compute instances, clusters, and Azure Kubernetes deployments, supporting model registration, Prompt Flow jobs, and custom Docker scripts for environment configuration
  • Worked cross-functionally with data scientists and business stakeholders to gather requirements and translate them into robust data engineering solutions
  • Implemented monitoring and performance logging to enable data quality validation and root-cause troubleshooting

Graduate Teaching Assistant - Big Data

Trent University 2023 - 2024
  • Assisted the course director in teaching Introduction to Databases, Introduction to Data Science, and Data Visualization courses, fostering engagement among 160 students
  • Mentored students in SQL (MySQL and PostgreSQL), NoSQL (MongoDB), and machine learning pipelines, achieving a 99% student success rate
  • Supported faculty in research projects, utilizing R for data analysis, and presented findings to stakeholders

Software Engineer

Capgemini 2021 - 2022
  • Migrated 85% of custom SQL databases from Java architecture for optimized data storage in product lifecycle development
  • Improved query execution speed, enabling faster data validation and enhancing storage optimization

Projects & Publication

NYC Taxi Analysis using Synapse

Built a real-time data pipeline processing 1M+ events using the Azure Synapse Analytics

Azure Synapse Analysis TSQL

Backorder Prediction

Developed and hosted a Web Application to Analyze and Predict the Backorder of the products in the warehouse

Python SQL Flask AWS S3 AWS CodePipeline AWS Elastic Beanstalk

Crime Analysis Application

Developed a Streamlit Dashboard to analyze the crime data for the city of Chicago

Python Streamlit Docker GCP BigQuery SQL DBSCAN Clustering Dashboard

Flight Delay Prediction

Created a web application to predict whether a Flight will be delay or not based on the input from the user

Python Flask Random Forest Classifier

Certifications

Udemy

Azure Databricks & Spark For Data Engineers: Hands-on Project

Show credential
Udemy

Azure Synapse Analytics For Data Engineers: Hands-on Project

Show credential
Udemy

Azure Data Factory For Data Engineers

Show credential
AWS

AWS Certified Cloud Practitioner

Show credential

Let's Connect

Get in Touch

devvratmungekar53@gmail.com
+1 (249) 876-6843
Calgary, AB
Social Links
Download Resume

Let's Collaborate! Send me a Message