Interop Labs
All Remote Jobs > Software Development
2 months ago
AI ML Distributed Systems Engineer
Worldwide
system training software
Share this job
Source: Remote | OK
About the job
We are actively seeking a talented individual to join our team as an AI distributed systems engineer with a keen interest in blockchain technology. This position is integral to a novel project that synergizes AI and blockchain expertise. The individual will play a pivotal role in crafting large-scale AI systems for training and inference within the dynamic intersection of blockchain and AI.
- Design and build efficient compute environments for AI/ML workloads on the intersection of distributed systems/blockchain technologies. Â
- Build and extend AI compute engines to achieve cutting edge performance for specific workload footprints.Â
- Evaluate models and workloads on different hardware and software stacks.Â
- Collaborate with cross-functional teams to design, implement, and optimize the AI engines.
- BSc, MSc or Ph.D. specializing in distributed systems and AI.
- Excellence in Python, Rust, and/or Go programming languages.
- Expertise in building and leveraging large-scale AI systems for training and inference such as Hugging Face, vLLM, CUDA.Â
- In-depth understanding of performance and system bottlenecks in AI applications.
- Knowledge of the performance characteristics of AI workloads and optimization strategies.
- Autonomous, distributed environment with the opportunity to work collaboratively in a diverse team across the world.
- The scope to contribute to high impact work and really make a difference in a decentralized protocol.
- The chance to challenge yourself whilst learning heaps of stuff in the process.
- Unlimited time off throughout the year to rest and recharge.
- Competitive compensation with stock options, experiencing growth from the initial phase.