Laksh Krishna Sharma
AI Engineer
Delhi, India
Specializing in scalable AI solutions with expertise in full-stack development, machine learning, and Generative AI. Focused on optimizing costs, architecting robust APIs, and deploying production-ready applications on cloud platforms.
Professional Experience
My career journey in AI engineering and software development
AI Engineer Intern
Aiseer Intelligence Systems Pvt. Ltd. (TrustAstrology.ai)
Nov 2025 – Present
Gurugram, India
- Led LLM cost optimization, reducing token consumption by 35–45% through prompt compression, JSON to TOON migration and context summarization.
- Migrated AI workloads from Google Gemini to AWS Bedrock, cutting inference costs by ˜30%.
- Designed context-engineering pipelines to summarize conversations with 50%+ lower context.
- Architected in-house Astro Engine API, replacing paid providers, reducing API costs by 85%+.
- Built GraphQL APIs using FastAPI and pyswisseph for Vedic astrology at production scale.
- Designed AWS infrastructure (VPC, subnets, NAT Gateway, ALB) for Report and Astro Engine APIs.
- Containerized with Docker, published to ECR, deployed via ECS Fargate for horizontal scaling.
- Built revenue-generating report platform using Next.js and Playwright for paid sales.
- Optimized report prompts, lowering per-report cost by ˜25%.
- Scaled async processing with BullMQ and Redis for high-throughput execution.
- Deployed on AWS EC2 with Nginx, PM2, Bastion Host, ALB for secure access.
AI Engineer Intern
Ant Creatives
Jun 2025 – Sep 2025
Remote
- Designed agentic RAG AI assistant for SuperSalesIQ CRM, enabling queries over 10k+ leads.
- Built pipelines using LangChain, LangGraph, FastAPI, Milvus, MongoDB, Docker, achieving ˜30% faster responses.
- Engineered vector-search workflows, improving accuracy by 25%+.
- Developed backend for SuperConnect, syncing 1k+ LinkedIn connections/session to Sheets.
- Designed authentication, extraction, webhook layers for real-time CRM enrichment.
Full-Stack Developer Intern
VENUEMONK
Dec 2024 – Mar 2025
Gurugram, India
- Modernized APIs using async/await, improving throughput by 40%.
- Migrated Flask to FastAPI, reducing latency by ˜25%.
- Integrated LLM endpoints with LangChain, reducing manual processing by 20%+.
- Refactored backend, cutting development turnaround by ˜30%.
Open Source Contributions
My contributions to the open source community
Apache Airflow
View PRDeprecated wait policy in favor of wait for completion, aligning with Amazon operator standards.
Apache Airflow
View PRFixed EmrCreateJobFlowOperator deferral logic to respect wait policies, ensuring submit-only jobs no longer defer unintentionally.
Apache Iceberg
View PRContributed to the apache/iceberg-python repository by enhancing the test_version_format() function, fixing error messages, and adding additional tests to provide clearer guidance for version mismatches. This update helps users effectively troubleshoot version conflicts.
MDAnalysis
View PRContributed to the MDAnalysis/mdanalysis repository by enhancing the AtomGroup.unwrap() method, fixing error messages to provide clearer guidance for missing bond definitions. This update helps users effectively resolve issues related to undefined bonds.
Publications
Research papers and academic contributions
A Comprehensive Survey in ANN-based Customer Churn Prediction
Published in ICDAM-2025
A comprehensive survey exploring Artificial Neural Network approaches for customer churn prediction, analyzing various methodologies and their effectiveness in predicting customer behavior patterns.
Technical Skills
Technologies and tools I use to build robust solutions
Programming Languages
AI & Machine Learning
Backend & Frontend
Databases
Message Queues
Cloud & DevOps
Projects
HearU
AI-powered confidential mental wellness companion to write journals, featuring a FastAPI backend, React.js frontend, voice asistant with Gemini, Github Actions CI/CD and Kubernetes deployment on GCP.
ViewBlogCast-AI
BlogCast-AI converts video content into Blogs, making it easy to share insights and knowledge in a written format with help of 2 AI agents, one for video research and another to write blogs, BlogCast-AI streamlines the process of content creation.
ViewRAGVerse
A modular pipeline that scrapes web content, embeds it using state-of-the-art embeddings, stores the embeddings in Astra DB, and serves contextual answers using Groq-hosted LLMs through LangChain.
ViewNeural Network from Scratch
Implemented a two-layer neural network in C++ on MNIST, including forward propagation, backward propagation and gradient descent.
ViewSpam Detection
Multi-lingual spam classifier using Logistic Regression, Tfidf, and async task queue with Celery.
ViewQueryQuill
AI chatbot that processes 75+ documents, extracts insights, and answers with citations.
ViewNaviEyes
NaviEyes aims to change the narrative by offering a real-time, immersive, and intuitive solution. Powered by Groq's ultra-fast inference capabilities, NaviEyes leverages cutting-edge technology to bridge the gap between what the visually impaired can’t see and the world they deserve to experience.
ViewQuantitative-Trading-Strategies-ML-Optimization
Developed and optimized machine learning models for trading strategies using historical market data.
View