Snr Site Reliability Engineer - Quizlet (50M active users) - SF, Denver, Remote
Source Coders Inc

  • Company:

  • Technial Recruiting partner:

  • Location: Onsite in San Francisco or Denver or Remote for CST or EST based candidates 

  • Compensation: $120K-$200K (heavily dependent on experience and work location)

  • Work visas accepted: US Citizen, Green Card, H-1B transfer, TN Visa

Quizlet’s mission is to help students (and their teachers) practice and master whatever they are learning. Every month more than 50 million active learners from 130 countries practice and master more than 300 million study sets on every conceivable topic and subject. We are developing new learning experiences by modeling how students learn and drawing upon knowledge acquisition, retention, and pedagogy in cognitive science. We are always seeking to help students master any subject by optimizing study efficiency and engagement. Want to be a go-to person for site reliability on the most-used learning platform in the U.S.? Want to work on a service that is rapidly scaling and relied upon by millions of students and teachers worldwide?  Quizlet is an indispensable utility used daily by millions of students and teachers around the globe. If our site goes down, even just for a few minutes, the pain is felt intensely. Speed is crucial, and downtime is not an option as we grow — during the school year, we are in the top 20 most-visited websites in the U.S. These are challenges you will face on day one at Quizlet.

What you'll do

    • Engage with service owners to improve the entire service lifecycle — from inception and design, through deployment, operation, maintenance, and sunset.

    • Help service owners drive their services through the service lifecycle through activities such as system design consulting, developing software platforms and frameworks, capacity planning and launch reviews.

    • Help service owners maintain their services once they are live by measuring and monitoring availability, latency, and overall system health.

    • Help scale systems sustainably through mechanisms like automation and evolve systems by pushing for changes that improve reliability and velocity.

    • Practice and evangelize sustainable incident response and blameless postmortems.

What we are looking for

    • Experience in designing, analyzing and troubleshooting distributed systems serving production traffic.

    • Experience with algorithmic thinking, data structures, and software complexity.

    • Experience in writing scripts in one or more languages such as Python or Go

    • Systematic problem-solving approach, coupled with strong communication skills and a sense of ownership and drive.

    • Ability and desire to debug and optimize code and automate routine tasks.

    • Experience with on-call duty, know why it’s hard, work to improve it, and make it so well documented that every engineer wants to be on rotation.

    • {Passion|Interest|Experience} with automation of code testing and deployment through the use of containers.

Posted about 1 year ago