Who we are: Our team supports the Bloomberg Trading Solutions platform, which provides hosted Buyside and Sellside Order Management Systems (OMS) to some of the largest institutional broker-dealers, asset managers, asset owners and hedge funds in the world. Each OMS instance is composed of services and databases that are developed, owned, and managed by hundreds of engineers across the world.
To offer these products to our clients, the Runtime SRE team conceives and manages the hardware and software to manage the highly available distributed architecture to host these OMS's.
Our team strives for:
Reliability to avoid any downtime that can lead to financial consequences for our clients
Observability to ensure our systems are reliable and that issues can be addressed before they become a problem
Automation of systems to ensure the platform is used fairly, as well as to facilitate triaging when issues arise
We'll trust you to:
Own, manage, monitor and optimize the reliability and overall health of our development and production environments
Configure newly allocated clusters and hosts, in addition to streamlining and automating the quality control pipeline
Monitor current capacity, conduct regular capacity testing and predict future capacity needs
Manage the collection and analysis of availability metrics for the management of shared resources (both system and application resources)
Collaborate on future design and implementations of our platform ensuring optimal resource usage while ensuring client isolation
You'll need to have:
4+ years working with an object-oriented programming language (C/C++, Python, Java, etc.)
A Degree in Computer Science, Engineering, Mathematics, similar field of study or equivalent work experience
Prior contributions to system design and architecture and scaling fault-tolerant, distributed systems
An understanding of Computer Science fundamentals such as data structures and algorithms
Strong Unix or Linux fundamentals (or basic knowledge and a strong desire to learn)
Ability to troubleshoot and triage production issues with distributed systems
We'd love to see:
Programming and scripting using Python and any shell variant such as KSH and Bash
Configuration management tools (like Chef, Puppet, Ansible, or Salt)
Containerization and orchestration technologies (like Docker, Kubernetes)
Grafana, Splunk, Humio
You can learn more about how we work and what it means to be an SRE at Bloomberg here: https://www.techatbloomberg.com/blog/bloomberg-bets-big-on-sres/
And here are some conference presentations by our team members: https://www.usenix.org/conference/srecon22emea/presentation/liku https://www.usenix.org/conference/srecon21/presentation/chen https://www.usenix.org/conference/lisa19/presentation/chen https://www.usenix.org/conference/srecon19americas/presentation/chen
Bloomberg is an equal opportunity employer, and we value diversity at our company. We do not discriminate on the basis of age, ancestry, color, gender identity or expression, genetic predisposition or carrier status, marital status, national or ethnic origin, race, religion or belief, sex, sexual orientation, sexual and other reproductive health decisions, parental or caring status, physical or mental disability, pregnancy or parental leave, protected veteran status, status as a victim of domestic violence, or any other classification protected by applicable law.
Bloomberg is a disability inclusive employer. Please let us know if you require any reasonable adjustments to be made for the recruitment process. If you would prefer to discuss this confidentially, please email email@example.com.
Salary Range: 160,000 - 240,000 USD Annually + Benefits + Bonus The referenced salary range is based on the Company's good faith belief at the time of posting. Actual compensation may vary based on factors such as geographic location, work experience, market conditions, education/training and skill level. We offer one of the most comprehensive and generous benefits plans available and offer a range of total rewards that may include merit increases, incentive compensation [Exempt roles only], paid holidays, paid time off, medical, dental, vision, short and long term disability benefits, 401(k) +match, life insurance, and various wellness programs, among others. The Company does not provide benefits directly to contingent workers/contractors and interns.