Site Reliability Engineer

Logo.gif?ixlib=rails 2.1

Headquarters: New York, NY

The Infrastructure team
The infrastructure team is interested in building strong foundations for the rest of Petal to pave paths of success upon. The tools Infrastructure uses are at the forefront of industry practices and community driven technology.  We dabble in container orchestration and spend our mornings thinking about how to tame distributed applications. We’re passionate about scalability, reliability, and simplicity, but most of all we’re interested in the empowerment of the company as a whole.
The Infrastructure Software Engineer role
Infrastructure engineers will be important agents of cohesion, in which many teams rely upon your foresight and expertise in order for the bigger picture to come together. The position is best for curious, generalist programmers who are deeply familiar with web application infrastructure and love to apply software engineering principles to make their and everyone else’s lives easier.
Here is our current tech stack: Petal, we're looking for people with kindness, positivity, and integrity. You're encouraged to apply even if your experience doesn't precisely match the job description. Your skills and potential will stand out—and set you apart—especially if your career has taken some extraordinary twists and turns. At Petal, we welcome diverse perspectives from people who think rigorously and aren't afraid to challenge assumptions.

Key responsibilities

  • Be responsible for the overall health and performance of Petal’s underlying infrastructure.

  • Participate in the optimization of the entire lifecycle of services - Deployment, Scaling, Monitoring, and Optimization.

  • Know standard security practices and identify any potential infrastructure-specific vulnerabilities.

  • Write code. We want engineers who can automate the deployment, administration, and monitoring of our large-scale Linux environments. We’re a strong believer in writing code to solve mundane problems.

  • Gain deep application-level knowledge of our systems and contribute to their overall design.

  • Work with development teams to enhance, document, and establish processes and generally improve the operability and security of our systems.

  • Improve automation of operational processes (provisioning, replication, deployments, continuous integration)

  • Bring monitoring, alerting, and observability for production and nonproduction issues to the next level

Characteristics of a successful candidate

  • At least 5 years of dev-ops or site reliability engineering experience. Bonus points for experience in a rapidly growing tech startup.

  • Familiarity with open source. We use, learn from, and contribute to many open source products. Familiarity with concepts and principles that are popular throughout open source is a useful skill.

  • Capable programmer. Infrastructure remains nimble (and sane) by putting automation and software at the forefront of everything it does. We’re looking for candidates whose main tools include the ability to think and act from a programmatic mindset. The ability to recognize the need to automate, when duplication has become burdensome, how to keep things simple, and when it's appropriate to write code, embodies some of many traits that lead to success, allow us to scale, and eliminate any growing tech debt.

  • Strong Linux and Networking knowledge. We walk the cloud native walk and as such need to be deeply familiar with how the networking and terminal side of things work for when things seem to stumble.

  • Knowledge of web design architecture and scalability. With Petal’s current rapid expansion we need candidates who are experienced at designing, building, and maintaining the web architectures of the future.

  • Strong self-management, drive, and organization.  Ability to multi-task and project manage in a fast-paced environment is essential.

  • Sharp and critical eye for details. The ability to think holistically and also maintain focus on small intricate details is essential for the high-impact, production work infrastructure does.

  • Problem-solving versatility and resourcefulness.  There will be many new and unexpected problems, and we need someone who can do the required research/networking to propose well thought-out solutions.

  • Outstanding communication skills, verbal and written.

To apply:

Posted about 1 year ago