Fahad Mattoo

Fahad ul Hassan Asif Mattoo

Senior Software Engineer building AI agents, LLM-powered systems, and generative AI applications.

Fahad Mattoo

About Me

I'm a Senior Software Engineer at Emumba in Islamabad, where I work on AI agent systems and large language model-powered applications.

At Emumba, I'm building Avtar, an avatar-based personal AI assistant, and Chatterbox, a multi-agent chatbot system. My work spans LangGraph-based agentic workflows, multi-agent orchestration with ~20 integrated tools, real-time voice processing with LiveKit, computer vision pipelines using MTCNN and rembg, and memory retrieval optimization with Mem0 and pgvector. I'm also implementing RAG pipelines for legal document processing — including contract analysis and document hierarchy detection — as well as integrating TTS services and the Hedra API for avatar face generation.

Previously, I worked as a Machine Learning Engineer at DCube Tech, where I built RAG-based chatbots, a multi-user multi-document chat engine, and an end-to-end structural prediction system for API call payloads. Before that, I worked as a Data Scientist at PackageX on Named Entity Recognition pipelines, as a Research Assistant at NCRA-RDDL applying traditional ML techniques across diverse domains, as a Teaching Assistant at NUST-EME in the Mechatronics department, and as a Management Associate at CBL overseeing the Mechanical Workshop department.

Interests

I'm deeply passionate about the evolving landscape of Generative AI and autonomous systems. My current areas of interest include:

  • AI Agents & Multi-Agent Systems — designing goal-driven agents that reason, plan, and collaborate to solve complex tasks using frameworks like LangGraph and LangChain.
  • Model Context Protocol (MCP) — building and integrating MCP servers to extend agent capabilities with external tools and data sources.
  • Agentic RAG — moving beyond static retrieval pipelines toward agents that adaptively query, reason over, and synthesize information from multiple knowledge sources.
  • LLM Orchestration — exploring patterns for memory management, structured outputs, tool use, and long-horizon planning in production-grade LLM systems.

Deep Learning Setup

GPU RTX 3060 12 GB
CPU Ryzen 3600
RAM 16 GB DDR4

Research

Published papers and ongoing research in machine learning, deep learning, and applied AI.

Masters Thesis
Masters Thesis

Development Projects

Professional work at DCube, NCRA, and PackageX.

JSON Generation Search Bot Using GPT

JSON Generation Chatbot Using Open Source LLMs

RAG Search Chatbot for Website Data (GPT)

RAG Chatbot (Langchain)

Multi-user RAG Based Chatbot

Human Activity Recognition Using YOLOv7

LayoutLMv2 for Named Entity Recognition

OCR

Mobile Device OCR Testing

EDA

Other Projects

Coursework, side projects, and unpublished research.

Bottle Level Measure

2D to 3D Image Generator

Self Driving Car

Photography

Photography collage

Other than studying like a book worm and working like a nerd, I can also draw a bit and take a few photographs.