Description
We are on the lookout for a Principal Software Engineer I to join our Elasticsearch - Distributed Systems team and focus on how Elasticsearch provides scale, performance, and resilience. This team owns how nodes in an Elasticsearch cluster communicate, and how data are indexed, allocated, and replicated across nodes.
Improving Elasticsearch’s components that support concurrent and consistent indexing across multiple machines.
Maintaining our cluster coordination system to keep performance high even though nodes come and go from the cluster and data moves around, while maintaining the safety and liveness properties of the system as a whole.
Pushing the limits on the number of shards, nodes, and petabytes that Elasticsearch can handle today
Looking into all kinds of issues, including performance or concurrency issues, and proposing solutions.
Supporting our support engineers with the harder problems.
We are looking for someone with a strong background in distributed systems and consensus algorithms. You should have strong skills in core Java and be conversant in the standard library of data structures and concurrency constructs, as well as newer language features. You should also have a deep technical proficiency in algorithms and the ability to understand and work on complex, highly distributed systems.
Additional responsibilities include owning projects from beginning to end, which covers both technical design and working with others to develop needed components. You should also have experience managing projects involving multiple engineers.
Bonus points for experience with data stores, search, analytics, Lucene, and asynchronous event-driven network frameworks such as Netty.