Senior Infrastructure Engineer - Build and DevEx
We are looking for a versatile Senior Infrastructure Engineer who thrives in a fast-paced yet friendly environment and is motivated by the journey of company-building. The ideal candidate is experienced with build tools such as Bazel and pipelines including continuous integration and deployment. We’re keen on resourcefulness and judgment to manage the day-to-day activities of the organization and its leadership with relative autonomy. The candidate must be well-organized, proactive, responsive, flexible, and enjoy the challenges of supporting a diverse and distributed workforce of individuals. The ideal candidate "gets things done, and done right" through consistent, productive interactions with company staff at various levels, as well as with external parties. The candidate handles sensitive information with appropriate care and maintains positive and professional communications even under pressure.
Key Responsibilities
- Work to maintain and improve developer’s experience in our monorepo environment
- Design and implement build pipeline automation for our software builds and releases
- Help improve release processes for our internal tooling and platforms
- Maintain and improve our Bazel and Buildbarn build system
- Collaborate on our internal HPC batch system (golang/python) used for hardware builds and tests
- Mentor other members of the team in technical and business strategies
- Review designs and code from others
- Be available during on-call periods. On-call rotations are once a week as primary every six weeks.
Qualifications
- Bachelor’s degree in Computer Science, or relevant experience
- 3+ years working in Infrastructure with a focus on build systems
- 5+ years experience in programming languages such as Golang and Python
- Strong communication skills and able to work well with others
- Familiar with different build pipelines (Jenkins, Concourse, Cloud Build, etc)
- Proficient with Bazel and Starlark - Able to write and debug custom build rules
- Familiar with RBE systems for Bazel such as Buildbarn, and able to debug such systems
- Ability to identify key metrics to measure for success of products using frameworks such as Prometheus and Grafana
- Proficient with Git for version control
Preferred Skills
- Experience with debugging and supporting a Kubernetes cluster, including maintenance on the cluster itself as well as deployment and management of applications on the cluster
- Experienced in either AWS or GCP cloud environments
- Experience with testing pipelines (integration, e2e, matrix, etc)
- Buildbarn as an RBE for Bazel, and an understanding of the build event stream
- Experience with REST and gRPC protocols
- Understanding of micro services and their use cases
- Familiar with terraform
About Us:
Enfabrica is on a mission to revolutionize AI compute systems and infrastructure at scale through the development of superior-scaling networking silicon and software which we call the Accelerated Compute Fabric. Founded and led by an executive team assembled from first-class semiconductor and distributed systems/software companies throughout the industry, Enfabrica sets themselves apart from other startups with a very strong engineering pedigree, a proven track record of delivering, deploying and scaling products in data center production environments, and significant investor support for our ambitious journey! Together, with their differentiated approach to solving the I/O bottlenecks in distributed AI and accelerated compute clusters, Enfabrica is unleashing the revolution in next-gen computing fabrics.