Member of Technical Staff - LLM Inference at dottxt

We are redirecting you to the source. If you are not redirected in 3 seconds, please click here.

Member of Technical Staff - LLM Inference at dottxt. Remote Location: Remote - USA. At . .txt. , our mission is to make AI reliable. We are the authors of . outlines. and . outlines-core. , both leading open source libraries (+10k ⭐️) for structured generation.. We raised . $11.9 million. , which is fueling the efforts of our global, fully remote team to create software that goes beyond simple conversation.. We support the most popular forms of structured generation through our existing products like . dotjson. and . dotlambda. , and are always working on the next innovation.. Read more about . .txt. and our technology on our . blog. .. The Role. We’re looking for an strong engineers to drive breakthroughs in . LLM inference optimization. with structured generation. If you thrive in fast-paced startup environments and love pushing the boundaries of how AI systems deliver structured results (think backend engineering that . actually. cuts latency, boosts throughput, and reduces resource costs), then we want to hear from you.. About You. Proven experience deploying inference engines. like vLLM, SGLang, or TensorRT. Experience with distributed inference (multi-GPU, single node) and low-latency communication (NCCL).. Hands-on knowledge of NVIDIA GPU architecture . (CUDA, CUDA cores, memory hierarchy). Track record of improving inference performance. (e.g., improving throughput by 20% through kernel optimizations). Background in . LLM MLOps. (e.g., monitoring, scaling, fault tolerance for inference services).. Proficiency in . Python. and familiarity with (or willingness to learn) . Rust. Understanding of containerization. (Docker, Kubernetes) and Linux systems. Why you should join us:. 🚀 Cutting-edge technology. Structured generation is still a nascent technology. Innovation is not the exception, it’s the rule.. 🌐 . Remote first. Work from anywhere in the world. We have a culture of written communication, and favor infrequent organic discussions over regular large team meetings.. 💸 . Competitive compensation and benefits. We pay market rate (adjusting for seed-stage startup) + equity options, offer health and dental insurance, and have a 401k (US Only). We’ll get you a GPU if you don’t have one already.. Location. .txt is a fully remote company. We value the importance of frequent, synchronous communication while recognizing the importance of strong written communication.. Applying. Please provide a . 1-page. resume in English.. Kindly, . do not. apply to . more than one . position at a time.