Bhanu Kedhar Sunku — AI/ML Engineer & Researcher

What I work with

Skills & Tech Stack

From research to production — end-to-end Agentic-AI and ML engineering across the full stack.

🤖

LLMs & Agents

LangChain LangGraph RAG Multi-Agent Prompt Engineering PEFT / LoRA Fine-Tuning OpenAI Claude API MCP Hugging Face N8N Tray.io LLM-as-Judge

🧠

ML & Deep Learning

PyTorch Scikit-Learn XGBoost Transformers Gradient Boosting Random Forest OpenCV Pandas D3.js

⚡

Inference & MLOps

vLLM Ray Triton Kubeflow Distributed Inference MLflow FastAPI REST APIs FAISS Pinecone ChromaDB RAGAS DeepEval

🌩️

Cloud & DevOps

AWS SageMaker GCP Docker Kubernetes CI/CD Git Jenkins Tableau

🐍

Programming Languages

Python SQL JavaScript Bash

Where I've worked

Experience

From production ML systems to HPC research infrastructure — one role, many hats.

Jan 2025 — Dec 2025

Northern Illinois University

Graduate Researcher

Working with Dr. Alhoori on LLM-driven scientific figure analysis. Reconstructed scientific flowcharts with 90% accuracy across 1.4M figures using a Chain-of-Events, RAG and multi-agent pipeline (Phi-4, Qwen2.5-VL, InternVL, Llama4-Scout-as-judge). Deployed and maintained multi-node Ray + vLLM inference on NIU Metis HPC across 32 A100 GPUs. Also serving as Teaching Assistant for CSCI 502 (Java) and CSCI 503 (Python), mentoring 40+ graduate and undergraduate students.

LangGraph RAG vLLM Ray Phi-4 Qwen2.5-VL LLM-as-Judge HPC A100 GPUs

May 2024 — Dec 2024

Elevance Health

Data Analytics

Shipped a classification system auditing 144K+ reports at 95% accuracy, cutting manual review by 60% and eliminating SLA penalty exposure. Lifted minority-class recall by 20% via failure-mode feature engineering, k-fold CV, and hyperparameter tuning. Built D3.js dashboards tracking pipeline-stage progress — adopted as the default daily artifact for the ops team. Deployed on SageMaker and Kubeflow.

Scikit-Learn SageMaker Kubeflow XGBoost D3.js REST API Python

Mar 2021 — Aug 2021

Alation

DevOps Engineer Intern

Reduced instance costs by 30% by migrating from MacStadium Anka to AWS Mac VMs. Built Jenkins CI/CD pipelines with auto-scaling and load-balancing for Docker-based environments, optimizing resource utilization at peak demand.

AWS Jenkins Docker CI/CD Auto-scaling

Dec 2020 — Mar 2021

Axcess.io

Cloud Engineer Trainee

Maintained and ensured uptime of multiple web applications using robust cloud infrastructure and monitoring techniques. Addressed DDoS attacks to safeguard websites against potential threats, and worked with cross-functional teams for continuous deployment.

Cloud Infrastructure Web Applications DDoS Mitigation CI/CD Monitoring

Research & Writing

Publications

Peer-reviewed research in visual analytics and human-computer interaction.

Under Review

FluidViews: Adaptive Drag-and-Drop Token Filters for Heterogeneous Multi-View Visual Analytics

Bhanu Sunku · Northern Illinois University

Interactive visual analytics workflows are often disrupted by rigid filter panels and context switches that break analysts' cognitive flow. FluidViews elevates filters to first-class, manipulable objects through two novel direct-manipulation interactions — Copy-as-Highlight for persistent cross-view comparison, and Drag-as-Filter to apply context-sensitive filters in place with no menus or modal dialogs required. A user study showed it eliminated cross-view navigation overhead vs. baseline dashboards.

Multi-View Visual Analytics Direct Manipulation Drag-and-Drop HCI D3.js React

📄 Read Paper 🎬 Watch Demo 💻 GitHub

What I've built

Featured Projects

A selection of Agentic, ML systems, research prototypes, and production work.

🔬

GitHub ↗

LLM-Driven Scientific Figure Analysis & Flowchart Reconstruction Pipeline

Built a framework processing ~110K arXiv papers and extracting ~1.4M figures with captions via PDFFigures2.0, GROBID, and PaddleOCR. Fine-tuned Llama4-Scout and Qwen2.5-VL with LoRA/PEFT on SPIQA and ChartLlama datasets. Designed a multi-agent orchestration pipeline (Phi-4, Qwen2.5-VL, InternVL) with Llama4-Scout as LLM-as-judge and HITL checkpoints. Built a flowchart reconstruction pipeline using Claude 3.5 Sonnet with structured outputs and RAG to generate Chain-of-Events (CoE) representations — achieving 90% accuracy.

Phi-4Qwen2.5-VLClaude 3.5 SonnetRAGLoRA/PEFTLLM-as-JudgevLLMRay

⚖️

GitHub ↗

Obesity Prediction: Leveraging LLM and Advanced Machine Learning Techniques

Implemented a majority-vote ensemble combining Decision Trees, Random Forest, SVC, KNN, and GPT-4 with K-fold CV and Optuna hyperparameter tuning — boosting accuracy from 94% to 98% while mitigating overfitting. Applied feature engineering, StandardScaler, and SMOTE class balancing for dataset robustness; evaluated with accuracy, weighted F1-score, recall, and precision.

GPT-4Scikit-LearnXGBoostOptunaSMOTERandom ForestK-fold CV

🌸

Live Demo ↗

Movies in Bloom: A Floral Glyph Visualization of Film Data

A custom D3.js glyph visualization that represents movies as flowers. Each flower encodes three variables simultaneously through visual form: the number of petals is scaled by the number of votes, the overall flower size reflects the movie rating, and the petal shape represents the MPAA rating category (G, PG, PG-13, R). A pleasing, scannable alternative to traditional bar charts for multivariate film data.

D3.jsObservable HQSVGCustom GlyphsData Visualization

🗺️

Live Demo ↗

GrantsFlow: Visualizing Louisiana's Post-Katrina Road Home Recovery

An interactive D3.js dashboard analyzing ProPublica's Road Home dataset — individual-level grants disbursed to Louisiana homeowners after Hurricanes Katrina and Rita. Built a custom D3 HexGrid map of Louisiana showing average damage per hex bin with brushing for region selection. Designed two novel custom glyphs: a Home glyph where 8 layered triangles encode the repair-cost percentage of total damage (with a blue back-area for rebuild cost), and a Chimney stacked bar showing cumulative distribution across the four grant types — Compensation, Additional Compensation, Elevation, and Individual Mitigation Measure.

D3.jsD3-HexGridObservable HQPythonGeoPandasData Visualization

Bhanu Kedhar
Sunku

Skills & Tech Stack

Experience

Publications

FluidViews: Adaptive Drag-and-Drop Token Filters for Heterogeneous Multi-View Visual Analytics

Featured Projects

Blog

Get In Touch

Bhanu KedharSunku

Skills & Tech Stack

Experience

Publications

FluidViews: Adaptive Drag-and-Drop Token Filters for Heterogeneous Multi-View Visual Analytics

Featured Projects

Blog

Get In Touch

Bhanu Kedhar
Sunku