Deepak Pathak
🔬

Hi, I'm

Deepak Pathak

Machine Learning Engineer & AI Researcher at DFKI

Machine Learning Engineer and AI Researcher at DFKI working on multimodal ML, earth observation, privacy auditing, and production ML systems.

About

I work across ML research and production engineering, with a focus on systems that need to be technically sound and operationally reliable.

At DFKI, my work includes multimodal crop-yield prediction from satellite, weather, and geospatial data; privacy auditing for ML models and LLMs in regulated settings; and MLOps infrastructure for reproducible, deployable workflows.

The work has led to publications in remote sensing and machine learning venues, along with production pipelines used with partners in Germany. Before DFKI, I spent four years at IBM building distributed telecom systems, and that engineering background still shapes how I approach projects.

Full background →

Experience

AI Researcher

DFKI — SmartCut

Coordinate DFKI-side execution for SmartCut, working with project partners while building Kedro pipelines for harvester-data harmonization and Earth Observation acquisition. Develop Apache Airflow DAGs for raw-data monitoring and automated pipeline execution for field operations.

AI Researcher

DFKI — MissionKI

Developed privacy auditing workflows for classification models and LLMs using membership inference and model inversion evaluations. Automated MLflow-based testing and reporting for healthcare AI compliance, and supported domain-specific LLM fine-tuning and privacy evaluation.

AI Researcher

DFKI — Yield Consortium

Developed multimodal crop-yield prediction systems across satellite, weather, soil, and geospatial features. Built preprocessing pipelines, Dash dashboards, and MLflow/GitLab CI/CD/Docker workflows used with partners in Germany.

Machine Learning Research Intern

Miele & Cie. KG

Built deep metric, contrastive, and self-supervised vision models for fine-grained classification, using PyTorch Lightning and Azure Databricks for scalable training.

Student Research Assistant

virtUOS, University of Osnabrück

Contributed to the SIDDATA digital assistant by supporting backend infrastructure and neural recommender integrations.

Working Student — Software Developer

Aitech Concept UG

Implemented TensorFlow object detection models and Django applications for real-time tracking from surveillance feeds.

Application Developer

IBM

Developed distributed Java EE services in a telecom SOA environment and built Oracle BPM workflows deployed on Oracle WebLogic Server. Also implemented Spring Boot and Twilio automation for incident flagging to speed operational response.

Education

M.Sc., Cognitive Science

Osnabrück University

Specialization in machine learning, computer vision, and data science.

B.Tech, Electronics Engineering

HBTI Kanpur (UPTU)

Technical Skills

Frameworks, platforms, and methods I use most often

Machine Learning

PyTorch
expert
Hugging Face
advanced
TensorFlow
intermediate
Multimodal Learning
expert
LLM Fine-tuning
advanced

MLOps & Pipelines

Python
expert
Kedro
expert
MLflow
expert
Apache Airflow
advanced
Docker
advanced

Cloud & DevOps

GitLab CI/CD
advanced
AWS SageMaker
advanced
Azure Databricks
intermediate
Linux
advanced
SQL
advanced

Remote Sensing & Trustworthy AI

Sentinel-2
expert
Earth Observation
expert
Computer Vision
advanced
Privacy Auditing
advanced
Selected Publications
(2025). Intrinsic explainability of multimodal learning for crop yield simulation. Computers and Electronics in Agriculture, Vol. 239, 2025.

Contact

Contact and profiles

Based in

DFKI GmbH

Kaiserslautern, Germany

View on Map