Job title: Principal Software Engineer – Develop AI software for software developers at US-Based AI Software Company – Remote with Relocation to Silicon Valley, CA – Up to R3.6 Million PA ($200K per annum)
Job type: Permanent
Emp type: Full-time
Salary: USD $200,000.00
Location: Remote - USA Relocation
Job published: 18/07/2024
Job ID: 39423

Job Description

This is an incredible opportunity for a Polyglot Software Engineer to join a cutting-edge US-based AI software development company. You will be at the forefront of developing an innovative AI-powered SRE (Site Reliability Engineering) agent that autonomously diagnoses and resolves production issues

This is a US-based opportunity, with the company providing relocation & visa assistance. It will initiate as a remote opportunity.

The CTC on offer for this Principal Software Engineer role is up to R3,6 million rand per annum ($200K per annum).

THE COMPANY
This US-based software development company is revolutionizing production environments with a first-of-its-kind AI-powered site reliability agent. This agent autonomously triages and identifies root causes for production alerts, prioritizes critical alerts, and executes intelligent runbooks, ensuring high efficiency and minimal manual intervention.

The company’s mission is to create autonomous software that can manage, repair, and optimize itself, freeing engineers to focus on innovation rather than operational toil.

THE ROLE 
As Principal Software Engineer, you will take ownership of the reasoning, learning, and integration of the SRE agent. You will design the technical architecture of the product, develop the Large Language Model (LLM) capabilities, and be one of the key stakeholders delivering the AI SRE agent to the global market.

You will develop the core features of the AI agent including the reasoning engine, memory module, evaluation stack, and simulation environment. You will be responsible for shipping a high-scale agent service capable of managing thousands of remote services and integrations concurrently.

SKILLS
Extensive software development experience (7+ years).
Expert artificial intelligence and machine learning experience.
Proficiency in multiple programming languages (e.g., Python, Rust, C++, Go). 
Experience with high-scale distributed infrastructure (Kubernetes, Kafka, Spark, microservices).
Familiarity with probabilistic systems and generative AI.