
Backend Engineer - AI-Powered Search & Applications at Jobgether. This position is posted by Jobgether on behalf of a partner company. We are currently looking for a . Backend Engineer - AI-Powered Search & Applications. in . India. .. This role is ideal for a highly skilled engineer passionate about building scalable backend systems for AI-driven applications. You will design, develop, and optimize microservices and APIs that support large-scale AI products, working closely with ML engineers and data scientists. Your work will ensure fast, cost-efficient, and reliable inference pipelines for LLMs, RAG systems, and multimodal models. Collaborating across global teams, you will deliver robust, production-grade solutions while maintaining high availability, fault tolerance, and operational excellence. This position offers autonomy, technical leadership opportunities, and direct impact on AI products used by millions of users worldwide.. . Accountabilities:. . Design, develop, and maintain scalable microservices powering AI-driven search and discovery systems.. . Build backend services and APIs to productionize LLMs, RAG pipelines, and multimodal models.. . Optimize inference pipelines for latency, throughput, and cost efficiency using techniques like batching, caching, and token budgeting.. . Take ownership of end-to-end backend projects, from system design and implementation to deployment and monitoring.. . Collaborate with ML engineers, data scientists, and Product Managers to translate business requirements into technical solutions.. . Implement rigorous testing, fault-tolerant practices, and clear documentation to ensure operational smoothness.. . Perform performance tuning, root cause analysis, and incident response for production services.. . Mentor junior engineers and contribute to a high-performance, collaborative engineering culture.. . . Bachelor’s or Master’s degree in Computer Science or a related field.. . 4–6 years of backend development experience, ideally with AI or large-scale data systems.. . Proficiency in Java, Golang, or Python, with strong coding and system design fundamentals.. . Experience designing and scaling distributed systems at production scale.. . Exposure to LLM inference setups (e.g., vLLM, Hugging Face Inference, Triton).. . Strong debugging, profiling, and performance tuning skills for latency-sensitive applications.. . Knowledge of storage systems, query optimization, and caching strategies.. . Hands-on experience with AWS (preferred), Kafka, and CI/CD pipelines.. . Ability to work autonomously in fast-paced environments and deliver high-impact solutions.. . Passion for mentoring engineers and fostering a collaborative, growth-oriented culture.. . . Company Location: India.