About

Here is a little background

Hey 👋🏼 I’m Farhan a Software Engineer based in Mumbai, India. I did my undergraduate in Electronics Engineering at Mumbai University. I love building AI that solves real problems the kind that works quietly, scales reliably, and feels simple to use. My work spans data pipelines, machine learning systems, and generative AI products. If it involves models, data, and clean code, that’s where I want to be. Outside of tech, I thrive on strength and discipline. I love lifting heavy 🏋️ and I’ve earned a black belt in Karate 🥋 after years of training, focus, and commitment to the craft.

Experience

Capgemini

AI Pre-Sales Engineer

Capgemini

PythonAzureFastAPILangChain

2023 - Present

  • Led AI solutioning and pre-sales technical architecture for agentic development frameworks.
  • Designed prompt engineering strategies and agent orchestration workflows for compliance automation.
  • Created comprehensive generative AI suites combining Code Converter, Mermaid Diagram Creator, and Model Validator.
  • Achieved 40-60% reduction in development and validation cycles through metamodel orchestration.
Capgemini

Data Scientist

Capgemini

PythonAzureFastAPILangChain

2023 - 2023

  • Developed RAG-based generative AI systems integrating LangChain, Vector DBs, and LLM orchestration.
  • Built FastAPI-based microservices for real-time inference and model serving, supporting 10,000+ daily API calls.
  • Implemented automated data validation frameworks ensuring compliance.
  • Optimized machine learning pipelines for scalability and performance.
Capgemini

Data Engineer

Capgemini

AzureSparkKafka

2021 - 2022

  • Built and maintained SoC-based Azure Data Pipelines using Azure Data Factory and PySpark.
  • Integrated data from embedded SoC modules into Azure Data Lake for regulatory analytics.
  • Developed Power BI dashboards for telemetry insights across IoT devices.
  • Designed scalable ETL workflows supporting downstream machine learning pipelines, improving forecast accuracy by 35%.
  • Implemented Kafka-based event streaming architecture for real-time data ingestion.
  • Optimized Databricks clusters and PySpark jobs, reducing cloud infrastructure costs by 28%.
Capgemini

Software Engineer

Capgemini

PythonOpenCVSQL

2019 - 2021

  • Designed OCR pipelines with Tesseract and OpenCV for multilingual degradation detection.
  • Automated regression workflows for PLM systems, accelerating release cycles by 3 weeks.
  • Developed an Azure Data Lake pipeline for nutritional analysis with 89% accuracy.
  • Created Power BI dashboards and regression models to visualize and forecast product stability.
  • Implemented automated data validation frameworks ensuring compliance.
  • Built Python-based data quality monitoring tools, detecting anomalies within 15 minutes.

Skills

Projects

Project Image 1

Project 1: Portfolio Website

Next.jsTailwindTypeScriptFramer Motion

I built this portfolio to master the modern React ecosystem, combining Next.js, TypeScript, Tailwind CSS, and Framer Motion into a seamless digital experience. Visually inspired by Mitchell Sparrow, adapted and built by Farhan Mallick. The design focuses on clean aesthetics and fluid interactivity. To ensure maximum performance and reliability, I architected the system to run on a lightweight local data layer, eliminating external dependencies while maintaining easy content manageability.

Insurance Underwriting Risk Engine

Project 2: Insurance Underwriting Risk Engine

PythonAzureML

An ML system that ingests medical reports to predict policy risk categories (Low, Medium, High). Features feature engineering, model calibration, and batch scoring pipelines to assist underwriters.

Contact

I have got just what you need. Lets talk.

+91-9137941850

frn315@gmail.com

Mumbai, India