Member of Technical Staff - RL Infrastructure

xAI · Palo Alto, CA

On-site Staff

ObservabilityCI/CDGoEval harnessesTool useMulti-agentOrchestrationPythonGPUPyTorch

ABOUT xAI xAI’s mission is to create AI systems that can accurately understand the universe and aid humanity in its pursuit of knowledge. Our team is small, highly motivated, and focused on engineering excellence. This organization is for individuals who appreciate challenging themselves and thrive on curiosity. We operate with a flat organizational structure. All employees are expected to be hands-on and to contribute directly to the company’s mission. Leadership is given to those who show initiative and consistently deliver excellence. Work ethic and strong prioritization skills are important. All employees are expected to have strong communication skills. They should be able to concisely and accurately share knowledge with their teammates. ABOUT THE ROLE: xAI is seeking experienced software engineers to create robust data pipelines, comprehensive evaluations for benchmarking LLMs, and automation frameworks to increase the productivity of researchers and engineers. Typical problems you will deal with include the following: We have a new agentic model capability that we’d like to improve. How do we design an efficient and robust environment for the agent to perform actions in? Evaluations and observability are a core part of knowing what we need to improve in our models. What new features can we add into our evaluation framework to ease the workflow of researchers & engineers and increase observability? A new open-source evaluation dataset has been released and researchers would like to track our models performance on it. How should we onboard it into our internal evaluation framework? Datasets have been collected that require complex pre-processing to prepare it for large-scale RL training. How do we standardize our preprocessing pipelines to minimize dataset onboarding time? A researcher on the team has an idea for how to augment a dataset to produce additional training data. How should we go about creating the data augmentation pipeline? RESPONSIBILITIES: Cr

Apply on company site →