Hitesh Pattanayak HiteshRepo

Hi, I'm Hitesh 👋

Senior Software Engineer with 10+ years building scalable data pipelines, cloud-native infrastructure, and enterprise backend systems. Currently working on indexing and searching petabytes of M365 data.

🌐 hiteshpattanayak.info · AWS Community Builder · CKAD Certified

🛠 Tech Stack

Languages: Go · Python · TypeScript / Node.js · PySpark
Data Engineering: Databricks · Apache Spark · Delta Lake · Azure Event Hubs
Cloud & Infra: Kubernetes · Docker · Azure · AWS · Terraform · Pulumi
Databases: CosmosDB · PostgreSQL · Elasticsearch · TimescaleDB
AI / LLM: RAG pipelines · Azure OpenAI · Anthropic API · Vector Search
Protocols & APIs: gRPC · REST · GraphQL

📚 Books

Ultimate CKAD Certification Guide — OrangeAva
Modern API Design with gRPC — OrangeAva

🎤 Talks & Features

🤖 AI Work

Semantic Search (RAG) — CosmosDB hybrid vector search + Azure OpenAI over petabytes of M365 backup data; natural language → metadata filters via few-shot Chat Completions
Elastic Dashboard Changelog — Python + Anthropic API tool that diffs unreadable .ndjson Kibana files and generates human-readable changelogs
Security Fix Automation — LLM-assisted local skill that ingests Cycode findings and applies targeted fixes with full code context
Blog Generator — AI-powered workflow (Claude / OpenAI) to draft posts from structured idea files
AI Chat Assistant — RAG conversational assistant on my blog site (TF-IDF + Netlify Functions + GPT-4o-mini)

💬 Try the AI Assistant

My blog has a built-in AI chat assistant. Ask it about my posts, projects, or background — it retrieves relevant content and answers using GPT-4o-mini.

👉 Chat at hiteshpattanayak.info

🧠 Currently Thinking About

I'm diving deeper into optimizing retrieval-augmented generation (RAG) pipelines, particularly focusing on refining the re-ranking mechanisms to improve answer accuracy. I'm exploring the use of hybrid vector databases that combine approximate nearest neighbor (ANN) search with traditional indexing techniques to enhance performance in handling complex queries. Additionally, I'm experimenting with chunking strategies to efficiently store and access knowledge based on page-aware contexts in AI chatbot applications.

_{Powered by Claude via scheduled GitHub Actions · view workflow}

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Hitesh Pattanayak HiteshRepo

Achievements