PhD Researcher · ML Engineer · Multi-modal AI Specialist
Building large-scale vision-language systems and internet-scale data pipelines. First-author publications at ICCV 2025 and CVPR 2026. Engineering consultant at TII Abu Dhabi and PhD researcher at Deakin University.
About
I am a PhD researcher at Deakin University and an Engineering Consultant at the Technology Innovation Institute (TII, UAE). I specialise in multi-modal AI — the intersection of vision, language, and speech. My work spans architecting internet-scale data pipelines, training large vision-language models on GCP/AWS, building VQA benchmarks, and publishing at top-tier venues including ICCV and CVPR. I hold a BSc in Mathematics and an MSc in Data Science (GPA 86%) from Deakin University.
Core Competencies
Vision-Language Models (VLMs), LLMs, Visual Question Answering, Multi-modal Reasoning, Speech-Vision-Language
PyTorch, TensorFlow, Keras, HuggingFace Transformers, Weights & Biases
OCR Pipelines, ETL, Web Crawling, Deduplication, LLM Filtering, SFT Data Generation, Agent-Based Pipelines
AWS, GCP, Docker, Linux, Slurm, Flask, React.js, Elasticsearch
Python, JavaScript, SQL, R, C++
Annotation Systems, STEM-VQA, Distributed Training, Benchmark Evaluation
Experience
Education