Sankalp

Sankalp's Portfolio Assistant

Online ยท trained on resume & projects

Replies use a Vercel serverless function ยท GROQ_API_KEY stays server-side

Places I've travelled

A clean travel shelf for the countries I have visited, with space to drop in one favorite photo from each place.

8Countries
1Photo Each

United States of America

Coming soon...Travel photo

India

Coming soon...Travel photo

France

Coming soon...Travel photo

Italy

Coming soon...Travel photo

Switzerland

Coming soon...Travel photo

Germany

Coming soon...Travel photo

Netherlands

Coming soon...Travel photo

Austria

Coming soon...Travel photo

Travel photos coming soon.

Sankalp Kulkarni

Things I've built & shipped

Production systems, applied research, and the kind of work that ships on Friday and stays up on Monday. Tap any card to flip โ†’

1.5+
Years
220K+
Embeddings
3.81
GPA
๐Ÿš€
Feb 2026 โ†’ Present Now

Stealth AI Startup

Software Developer Intern ยท Backend / AI ยท Remote

Designed and shipped REST API endpoints for user onboarding and profile standardization in a consumer app for physical-trade job matching. Owned backend development across all application components for the startup's physical AI product.

AI Backend pgvector Node.js
Flip
What I'm doingStealth Physical AI Startup ยท Backend
  • Designed and shipped REST API endpoints for user onboarding and profile standardization in a consumer app for physical-trade job matching.
  • Owned backend development across application components for the startup's physical AI product, including API contracts, service logic, data flow, and integrations.
  • Built a semantic role classification pipeline with pgvector and sentence embeddings mapping titles to ~900 O*NET occupations - 95% accuracy in offline eval.
  • Chose pgvector over a standalone vector DB to reduce ops complexity and keep embeddings colocated with profile data.
  • Wrote PyTest unit + integration tests and added structured logging; aligned roadmap weekly with EM and founder.
  • Collaborated with a 2-person frontend team to define API contracts and ship against weekly milestones.
โš™๏ธ
Jul โ†’ Sep 2025 โ€ข Chicago, IL

Quinnox Inc. / Qyrus

AI Engineer Intern

Built a production RAG system indexing 220K+ embeddings. Improved precision@5 by 22% via CLIP + Cohere ReRank v3. Fault-tolerant ingestion at 20K+ items/hour on AWS.

RAG AWS Pinecone FastAPI
Flip
What I shippedQyrus RAG Platform
  • Built and deployed a production RAG system (FastAPI, Redis, Pinecone, Docker) indexing 220K+ embeddings for low-latency search.
  • Designed embedding + reranking workflows with CLIP and Cohere ReRank v3 - +22% precision@5 while balancing latency and cost.
  • Architected a fault-tolerant async pipeline on AWS (S3, SQS, async workers, batch upserts) with backpressure and retry logic - 20K+ items/hour.
  • Sustained 300+ vectors/min ingestion throughput with structured metrics and distributed logs.
  • Integrated LLM generation with grounding, prompt templates, and fallback logic; validated end-to-end with PyTest + Postman.
๐Ÿ› ๏ธ
Jun 2023 โ†’ Jul 2024 โ€ข Bangalore, IN

Pcloudy / Opkey

Software Engineering Intern

Shipped Qpilot.AI MVP - automated test generation across 1,200+ OEM configs with 78% execution success and 90% reduction in test time. Kafka log pipeline at 33K+ events/min.

Flask Kafka AWS Docker
Flip
What I shippedQpilot.AI MVP
  • Designed and shipped the Qpilot.AI MVP (Flask, Python) - automated script generation and self-healing execution across 1,200+ OEM device configs.
  • Achieved 78% execution success and reduced end-to-end test time by 90%, significantly lowering manual intervention.
  • Built Python/Flask microservices on AWS with Kafka-based streaming log pipelines for real-time failure triage at 33K+ events/min.
  • Automated build, test, deploy via GitHub Actions; containerized with Docker for iterative rollouts.
  • Validated services with Postman + PyTest; added logging and monitoring for production reliability.
๐Ÿง 
2025 โ†’ Present Active

Sage

Multi-agent orchestrator & personal AI assistant

Local-first multi-agent system with RAG, persistent memory, live news, web search, WhatsApp delivery, Gmail triage, and proactive briefings - orchestrating specialized agents via Ollama tool-calling. No cloud, no tracking.

Python Ollama Multi-Agent FastAPI ChromaDB
Flip
Project deep-diveSage ยท Multi-Agent Orchestrator
  • Multi-agent orchestration via open-source Ollama tool-calling - a router LLM autonomously dispatches to specialized agents (web search, news, RAG, memory, reminders, Gmail) based on intent.
  • Local-first RAG pipeline over personal documents (PDF, Markdown, TXT) with ChromaDB vector search and selective retrieval - skips embedding overhead for casual chat.
  • WhatsApp integration via Twilio + FastAPI webhook: full chat, commands, and proactive reminder delivery with daily scheduling and missed-reminder recovery.
  • Gmail triage agent via Google OAuth - classifies inbox into ACTION / FYI / IGNORE with AI-generated summaries surfacing what needs a reply.
  • Persistent SQLite-backed sessions, facts, and todos with session resume, conversation analytics dashboard, and usage tracking.
๐Ÿ”Ž
CS242 โ€ข Information Retrieval

Fandom Wiki Search Engine

Search engine for fan-driven knowledge bases

Final project for CS242: Information Retrieval and Web Search. Indexes and retrieves relevant content across Fandom wikis so users can quickly find characters, lore, game mechanics, and community knowledge.

Search Ranking IR NLP
Flip
Project deep-diveFandom Wiki Search Engine
  • Built a specialized search engine across Fandom wikis for CS242: Information Retrieval and Web Search.
  • Indexed fan-driven knowledge bases so users can retrieve relevant information on characters, lore, game mechanics, and community content.
  • Implemented keyword-based retrieval and ranking techniques to return accurate, fast results for fandom-specific queries.
  • Designed around the needs of fandom communities, where search intent often depends on names, aliases, game terms, and lore-specific context.
๐Ÿ”
Apr 2025 โ†’ Present Pilot

SafeBot

Investigation platform for Riverside PD

Human-in-the-loop email triage and suspect-flagging with hybrid keyword rules + LLM risk scoring. Google OAuth, RBAC, audit logs. Deployed on GCP Cloud Run with CI/CD.

GCP LLM React PostgreSQL
Flip
Project deep-diveSafeBot ยท Applied AI for Investigations
  • Built a production investigation platform for Riverside Police Department processing 10K+ emails/hour for triage and suspect flagging.
  • Hybrid keyword rules + LLM-based risk scoring with RBAC, escalation policies, and review gates aligned to legal requirements.
  • Shipped REST APIs with PostgreSQL schema migrations and React + TypeScript dashboards with Google OAuth, admin console, and audit logs.
  • Deployed on GCP Cloud Run with Redis caching and autoscaling via GitHub Actions CI/CD.
  • Currently in pilot discussions for deployment to 5 RPD investigators.
๐Ÿ“ฑ
2023 โ€ข Published 2024

Android Log Search Utility

Real-time log parsing & classification

Parsed 250K+ Android log lines with failure classification. Published in the International Journal of Applied Engineering and Technology, London, 2024.

Node.js MongoDB Published
Flip
Project deep-diveAndroid Log Search Utility
  • Built a real-time Android log parsing and search system (Node.js, MongoDB) processing 250K+ log lines.
  • Added failure classification and recommendations for log lines, surfacing error patterns from noisy logs.
  • Improved debugging efficiency for engineers working across high-volume device fleets.
  • Work published as "Application Specific Log Parser" in the International Journal of Applied Engineering and Technology, London, 2024.
๐Ÿ“Š
2023

Test Device Recommendation Engine

ML-driven device matrix selection

Clustered 1K+ OEM configurations and ranked similarity over 125K+ log lines via TF-IDF. Replaced manual heuristics with data-driven scoring and cut device selection time to under 90 seconds.

ML TF-IDF Clustering
Flip
Project deep-diveTest Device Recommendation Engine
  • Built a recommendation system to automate test-device selection for QA teams managing massive device matrices.
  • Clustered 1K+ OEM configurations and analyzed 125K+ log lines using TF-IDF and similarity-based ranking.
  • Replaced manual heuristics with data-driven scoring - device selection time cut to under 90 seconds.
  • Integrated as a recommendation layer into the broader Pcloudy test automation platform.
๐ŸŽ“
Sept 2024 โ†’ Mar 2026

University of California, Riverside

MS in Computer Science
GPA 3.81 / 4.0
๐ŸŽ“
Dec 2020 โ†’ May 2024

PES University, Bangalore

BTech CS ยท Machine Intelligence & Data Science
GPA 8.3 / 10
๐Ÿ“„
2024โ€ขLondon

Application Specific Log Parser

International Journal of Applied Engineering and Technology

Real-time Android log parsing, failure classification, and search across 250K+ log lines - improving debugging efficiency for engineers working across high-volume device fleets.

View Paper

Let's build something.

Open to SWE / AI Engineer roles ยท United States