Dr. Phani
Siginamsetty
PhD-qualified Data Scientist and AI Engineer with 5+ years bridging cutting-edge academic research with enterprise-grade production systems. Expert in designing autonomous Multi-Agent Systems — where LLM-powered agents plan, reason, use tools, and collaborate to solve complex real-world workflows end-to-end. Deep hands-on experience across the full AI lifecycle: from RAG pipeline architecture and LLM fine-tuning (PEFT/QLoRA) to quantized edge deployment, MLOps, and scalable cloud infrastructure on AWS. Proven ability to translate research breakthroughs in multilingual NLP, multimodal AI, and quantum-motivated algorithms into production-ready systems — backed by 11 patents and 7 peer-reviewed publications in Elsevier, IEEE, and Springer. Passionate about building intelligent systems that are not just accurate, but autonomous, explainable, and deployable at scale.
Skills & Stack
Full-spectrum ML/AI stack — hover bars, click categories to explore.
Work Experience
5+ years across industry research, enterprise AI, and academia.
- Autonomous Fraud Detection: Spearheading real-time fraud detection using stateful multi-agent systems via the Agno framework with advanced tool-calling for complex transaction analysis.
- Advanced RAG Pipelines: Engineering a multi-agent RAG pipeline for "Smart Tutor" using vector databases and semantic routing to deliver personalized content while minimizing hallucinations.
- Enterprise Automation: Designing agentic workflows with LangChain and LangGraph to automate reporting with Human-in-the-Loop (HITL) mechanisms, reducing operational overhead.
- Edge GenAI & Quantization: Researched lightweight LLMs for on-device inference using GGUF/AWQ quantization to reduce memory footprint and latency on vehicular hardware.
- Computer Vision Diagnostics: Deployed an optimized CNN pipeline for real-time component recognition within the Vehicle Configuration Manager (VCM) to automate visual inspections.
- Healthcare AI: Architected a secure RAG chatbot for SRM Global Hospital to retrieve medical protocols while ensuring strict data privacy and proprietary data embedding.
- Audio Intelligence: Engineered a MoM automation API using FastAPI, STT, and speaker diarization to autonomously extract abstractive summaries and action items from recordings.
- Multilingual NLP: Developed MATSFT and MMSFT frameworks by fine-tuning mT5 for low-resource Indian languages, resulting in multiple high-impact journal publications.
- Quantum AI & IP: Architected quantum-motivated summarization processors for data compression, leading to multiple Indian Patents including 1 Granted Patent.
- Industrial Safety Vision: Deployed real-time object detection to monitor hazardous machinery, triggering emergency stops via spatial tracking of hand proximity to danger zones.
- Predictive Analytics: Developed ML models to forecast patient outcomes and translate clinical data into actionable insights for data-driven healthcare decisions.
- Software Mentorship: Instructed Data Structures, Algorithms, and Python, mentoring students in software engineering best practices and technical problem-solving.
Key Projects
Enterprise-grade AI systems built end-to-end — spanning GenAI, fraud detection, computer vision, and healthcare.
- Multi-format: PDF, DOCX, PPTX, PNG, JPG
- Auto-conversion of Word & PowerPoint to PDF
- Intelligent image extraction & spatial analysis
- Custom prompt instructions per document
- Module regeneration with new instructions
- Structured module generation via PHASE 1–3 analysis
- Interactive AI tutor with AWS Polly TTS (4 voices)
- Rich image explanations with analogies
- Auto question generation & answer validation
- Internet search integration for extended context
- Vector search via Pinecone + Titan Embed (1024-dim)
- Document sharing with edit proposals & review
- Course publishing for trainee access
- Real-time progress tracking during processing
- Markdown-supported module curation
- Roles: Super Admin, Trainer, Trainee
- Process-based org grouping (departments)
- Admin approval workflow for new trainers
- Configurable rate limits per role (SlowAPI)
- JWT auth with bcrypt password hashing
- TTS lectures with pause-for-questions
- Raise Hand feature during live sessions
- Voice input via Google Speech Recognition
- Quiz system with instant AI feedback
- Preview mode for trainers before publishing
- AWS RDS PostgreSQL + SQLAlchemy ORM
- S3 server-side AES256 encryption
- CORS + SQL injection protection
- Async operations via asyncio
- Rotating file handler logging
- Hybrid Risk Engine: Dual-layered system fusing statistical anomaly detection (XGBoost) with GenAI-driven forensics, reducing investigation time and false positives.
- Autonomous Rule Discovery: Multi-agent workflow for live transaction monitoring, detecting zero-day fraud patterns with Human-in-the-Loop oversight.
- Argus Agent: AI Data Analyst using
msoffcryptoto securely decrypt and parse sensitive financial datasets locally for evidence-based risk verdicts.
- Forensic Digitization: End-to-end vision pipeline using AWS Textract and OpenCV for layout analysis and digitization of MICR codes and payee details with high OCR accuracy.
- Signature Verification: PyTorch-based Siamese Neural Network for one-shot learning, using contrastive loss and feature embeddings to detect forged signatures.
- Cross-Modal Logic: NLP algorithms cross-verifying extracted semantic data (numeric vs. written amounts) to flag discrepancies for manual review.
- Clinical Guardrails & RAG: Domain-specific agent using Llama 3 and Vector DBs, with query expansion and re-ranking to ground answers exclusively in verified medical literature.
- Stateful Memory: Persistent context-retention engine using LangGraph and MongoDB to map longitudinal symptoms and medical history for personalized health insights.
Research & Patents
11 patents filed · 3 granted · 7 peer-reviewed publications in Elsevier, IEEE & Springer.
Education
A decade of academic excellence from SSC through PhD.