AI · Data · Systems Engineer
Vinay Gandhi
Built on data, driven by AI
15+ years across factory and field environments, now building Python/SQL data pipelines and applied AI — RAG, NLP, and ML — that turn raw telemetry and narratives into structured, queryable insight. Focused on data, AI, and factory systems.
About
Data first — AI and factory systems on top.
Data-focused, AI-driven engineer with 15+ years across factory and field environments. I build Python/SQL data pipelines and applied AI — RAG, NLP, and ML — that turn raw telemetry and free-text narratives into structured, queryable insight. Strong on data engineering and analytics, with factory-systems experience (MES, streaming, IIoT) underneath.
★ Flagship — Data + AI
Lead projects: free-text narratives and telemetry turned into structured, queryable data with applied AI.
QualityMind-RAG Data + AI
Conversational analytics over manufacturing data — Hybrid RAG (Pinecone) plus Text-to-SQL (Vanna 2.0, PostgreSQL), orchestrated with LangGraph agents to query structured + unstructured records.
ClaimLens Data + AI
Warranty-narrative NLP for field-quality RCA — classifies overcycle anomalies (Soft Reset, Cloud Sync) at macro-F1 0.90 on a 1,200-narrative synthetic eval and rolls free-text claims into Pareto failure trends that feed 5-Why / 8D.
Automotive visual QA Data + AI
Vision inference pipeline — edge CLIP, AWS serverless path, MES-style FastAPI integration for inline image classification.
Projects
Grouped by pillar, Data and AI first — the field-quality & warranty work hiring managers care about. Filter below, or scroll all.
Data & Analytics
Warranty / field analytics · SQL & Python pipelines · lakehouses · streaming ingestion · SPC
Federated Data Mesh
Data / ELT & Analytics
Crypto data ETL
Data pipelines and extraction patterns for crypto-market style analytical workloads.
Market Mood Ring
Market sentiment analytics — real-time data aggregation, scoring, and visualization across financial instruments.
Healthcare analytics
Large-scale analytics over healthcare datasets — big-data processing and feature pipelines.
LLSPIN
Healthcare research project — data analysis and modeling pipeline.
Streaming & Telemetry
Edge telemetry plane + Software
DETCP reference stack — MQTT → Rust edge (NATS JetStream) → gRPC → Go cloud (Kafka → TimescaleDB) for factory & fleet telemetry.
AEGIS · Foresight
Manufacturing correlation — PLC + MES fusion and streaming ML (ingest via edge-telemetry-plane).
AI Engineering
Factory-systems AI · ML · computer vision · agentic AI / RAG · robotics ML · forecasting
Factory Systems AI
AEGIS + Data
Manufacturing correlation engine — PLC telemetry, MES work orders, and streaming ML inference.
Factory Genius
Conveyance & rotating equipment PM: MQTT anomalies, BM25 RAG over manuals, optional LLM diagnosis, React dashboard.
Cell-to-Pack + Vision
EV battery assembly vision stack: 2.5D RGB+depth fusion, VLM-style inference API, PLC halt & MES audit trail.
Factory Systems AI platform — AEGIS, Factory Genius, and Cell-to-Pack form one factory-floor AI family (alongside digital-twin, factory-ops, and VisionGuard services). The next 6-month flagship roadmap — full story: factory-ai.html, featured in Factory systems & demos below.
Agentic AI & RAG
Warranty agent Multi-agent
Dealership payment adjudication — Intake → Policy → Fraud → Adjudicator pipeline with a deterministic route_claim() gate.
RAG QA system
Question answering over documents with retrieval-augmented generation patterns.
Data analytics AI agents
Agent-oriented flows for analytics workloads and structured reasoning over data.
Computer Vision & ML
AutoClaim-VLM
VLM vehicle-damage assessment ETL for insurance / fleet claims — Glue → SageMaker (PaliGemma 3B) → Redshift / DynamoDB with confidence-based routing. Latency / accuracy figures are projected design targets.
OCR Expense Intelligence + Product
AI-powered receipt OCR — extract merchant, date, totals, and line items; auto-categorize spend; dashboard analytics with CSV/Excel export. FARM-stack codebase (codename Extracta AI).
eCommerce Demand Forecasting
ARIMA(2,1,1) baseline upgraded to LSTM + TFT + N-BEATS ensemble — dbt feature mart, Airflow DAG, MLflow experiment tracking, FastAPI serving, Kafka streaming, and Prometheus drift monitoring.
Prism-Federated
Privacy-preserving distributed intelligence and federated learning for sensitive data silos.
Robotics & Embodied AI
VLA-bench + Data
Sim-to-real VLA fine-tuning profiler: WebDataset → FlashAttention-2 → FSDP. 3.3× throughput, −65% training cost per 500K-frame run.
RL-Pendulum
Sim-to-real RL pipeline: PPO policy trained in Gymnasium, exported to TFLite, deployed on ESP32 for physical pendulum balance.
Semantic SLAM Rover
Ground rover with semantic scene understanding — ROS 2 Humble, LiDAR SLAM, TensorRT FP16 on Jetson Orin Nano at ≥30 FPS.
Software / Factory Systems
Distributed backends · streaming infra · MES / EOL · EV / OTA · full-stack web
Streaming & Backend
EV & OTA
Overdrive OTA Manager
Enterprise fleet OTA orchestration — NATS control plane, S3 data plane, Go orchestrator, and React command center.
OTA firmware verifier
Signature & integrity verification for OTA firmware images before they reach vehicle ECUs.
Key provisioning
Automotive key provisioning — secure key lifecycle and distribution for vehicle ECUs.
Full-stack & Web
OCR Expense Intelligence + AI
SaaS-style expense tracker — upload receipt photos, async OCR pipeline (EasyOCR + PyTorch), MongoDB persistence, React dashboard with spend analytics and exports.
Weather WebApp
Responsive weather application — live API integration and a clean front-end build.
SmartFound
Lost-and-found style full-stack application with search and matching workflows.
HabitArc
Habit-tracking application — front-end state management and persistence patterns.
ToDo app (FARM stack)
Task manager built on the FARM stack (FastAPI · React · MongoDB).
MVP demos (Motel Portal, OCR Expense Intelligence, and more) — featured in Deep dives & demos above (full walkthroughs: mvp.html).
Product Management
PRDs · product demos · value-attribution dashboards
Product & Strategy
APEX-recover
PM-authored PRD and React/TypeScript prototype for the human-in-the-loop recovery workflow when a robot's autonomous policy fails on the factory floor.
TeleOp Flywheel Dashboard
Attribute exact dollar value to human-in-the-loop robotic teleoperation data — Streamlit dashboard surfacing operator quality at per-session granularity.
OCR Expense Intelligence + AI
B2B document-intelligence concept — receipt upload to structured expenses, categorization, and spending analytics; Dockerized FARM stack.
Browse everything on GitHub. Product demos live on MVP. Project case studies and EV references live in the interactive lab → Case studies.
Factory systems & demos
Deep-dive pages — flagship factory platform and live product MVPs.
Factory AI Platform
PLC + MES + streaming ML — architecture, tech stack, business value, and AEGIS build status.
View full breakdown →MVP Demos
Live product walkthroughs — Motel Web Portal, OCR Expense Intelligence, and more full-stack demos over time.
Watch demos →Theory & Programming
9 illustrated theory topics and 6 programming tracks — all hosted on this domain.