Laksh Krishna Sharma

AI Engineer

Delhi, India

Specializing in scalable AI solutions with expertise in full-stack development, machine learning, and Generative AI. Focused on optimizing costs, architecting robust APIs, and deploying production-ready applications on cloud platforms.

Professional Experience

My career journey in AI engineering and software development

AI Engineer Intern

Aiseer Intelligence Systems Pvt. Ltd. (TrustAstrology.ai)

Nov 2025 – Present

Gurugram, India

  • Led LLM cost optimization, reducing token consumption by 35–45% through prompt compression, JSON to TOON migration and context summarization.
  • Migrated AI workloads from Google Gemini to AWS Bedrock, cutting inference costs by ˜30%.
  • Designed context-engineering pipelines to summarize conversations with 50%+ lower context.
  • Architected in-house Astro Engine API, replacing paid providers, reducing API costs by 85%+.
  • Built GraphQL APIs using FastAPI and pyswisseph for Vedic astrology at production scale.
  • Designed AWS infrastructure (VPC, subnets, NAT Gateway, ALB) for Report and Astro Engine APIs.
  • Containerized with Docker, published to ECR, deployed via ECS Fargate for horizontal scaling.
  • Built revenue-generating report platform using Next.js and Playwright for paid sales.
  • Optimized report prompts, lowering per-report cost by ˜25%.
  • Scaled async processing with BullMQ and Redis for high-throughput execution.
  • Deployed on AWS EC2 with Nginx, PM2, Bastion Host, ALB for secure access.

AI Engineer Intern

Ant Creatives

Jun 2025 – Sep 2025

Remote

  • Designed agentic RAG AI assistant for SuperSalesIQ CRM, enabling queries over 10k+ leads.
  • Built pipelines using LangChain, LangGraph, FastAPI, Milvus, MongoDB, Docker, achieving ˜30% faster responses.
  • Engineered vector-search workflows, improving accuracy by 25%+.
  • Developed backend for SuperConnect, syncing 1k+ LinkedIn connections/session to Sheets.
  • Designed authentication, extraction, webhook layers for real-time CRM enrichment.

Full-Stack Developer Intern

VENUEMONK

Dec 2024 – Mar 2025

Gurugram, India

  • Modernized APIs using async/await, improving throughput by 40%.
  • Migrated Flask to FastAPI, reducing latency by ˜25%.
  • Integrated LLM endpoints with LangChain, reducing manual processing by 20%+.
  • Refactored backend, cutting development turnaround by ˜30%.

Open Source Contributions

My contributions to the open source community

Apache Airflow

View PR

Deprecated wait policy in favor of wait for completion, aligning with Amazon operator standards.

Apache Airflow

View PR

Fixed EmrCreateJobFlowOperator deferral logic to respect wait policies, ensuring submit-only jobs no longer defer unintentionally.

Apache Iceberg

View PR

Contributed to the apache/iceberg-python repository by enhancing the test_version_format() function, fixing error messages, and adding additional tests to provide clearer guidance for version mismatches. This update helps users effectively troubleshoot version conflicts.

MDAnalysis

View PR

Contributed to the MDAnalysis/mdanalysis repository by enhancing the AtomGroup.unwrap() method, fixing error messages to provide clearer guidance for missing bond definitions. This update helps users effectively resolve issues related to undefined bonds.

Publications

Research papers and academic contributions

A Comprehensive Survey in ANN-based Customer Churn Prediction

Published in ICDAM-2025

Read Paper

A comprehensive survey exploring Artificial Neural Network approaches for customer churn prediction, analyzing various methodologies and their effectiveness in predicting customer behavior patterns.

Technical Skills

Technologies and tools I use to build robust solutions

Programming Languages

C/C++PythonTypeScriptJavaScriptGo

AI & Machine Learning

TensorFlowScikit-learnLangChainLangGraphLangSmith

Backend & Frontend

FastAPIGraphQLExpress.jsChiGinReact.jsNext.jsRedux

Databases

PostgreSQLMongoDBMilvusRedis

Message Queues

BullMQRabbitMQCelery

Cloud & DevOps

AWS (EC2, ECS, ECR, VPC, NAT, ALB)GCPDockerNginxKubernetesJenkinsGitHub Actions

Projects

HearU

AI-powered confidential mental wellness companion to write journals, featuring a FastAPI backend, React.js frontend, voice asistant with Gemini, Github Actions CI/CD and Kubernetes deployment on GCP.

View

BlogCast-AI

BlogCast-AI converts video content into Blogs, making it easy to share insights and knowledge in a written format with help of 2 AI agents, one for video research and another to write blogs, BlogCast-AI streamlines the process of content creation.

View

RAGVerse

A modular pipeline that scrapes web content, embeds it using state-of-the-art embeddings, stores the embeddings in Astra DB, and serves contextual answers using Groq-hosted LLMs through LangChain.

View

Neural Network from Scratch

Implemented a two-layer neural network in C++ on MNIST, including forward propagation, backward propagation and gradient descent.

View

Spam Detection

Multi-lingual spam classifier using Logistic Regression, Tfidf, and async task queue with Celery.

View

QueryQuill

AI chatbot that processes 75+ documents, extracts insights, and answers with citations.

View

NaviEyes

NaviEyes aims to change the narrative by offering a real-time, immersive, and intuitive solution. Powered by Groq's ultra-fast inference capabilities, NaviEyes leverages cutting-edge technology to bridge the gap between what the visually impaired can’t see and the world they deserve to experience.

View

Quantitative-Trading-Strategies-ML-Optimization

Developed and optimized machine learning models for trading strategies using historical market data.

View

Certifications

Contact Me