Site Reliability Engineer - Client Support Services @ Kraken Digital Asset Exchange - Los Angeles, CA

Job Overview

2 months ago

Site Reliability Engineer - Client Support Services

Kraken Digital Asset Exchange - Los Angeles, CA

About Kraken
As one of the largest and most trusted digital asset platforms globally, we are empowering people to experience the life-changing potential of crypto. Trusted by over 8 million consumer and pro traders, institutions, and authorities worldwide - our unique combination of products, services, and global expertise is helping tip the scales towards mass crypto adoption. But we’re only just getting started. We want to be pioneers in crypto and add value to the everyday lives of billions. Now is not the time to sit on the sidelines. Join us to bring crypto to the world.

To ensure Kraken is the right fit for you, please ensure you read Kraken Culture Explainedto find out more about us!

As part of Kraken's Client Support Services (CSS) SRE Team, you will work within a world-class team of engineers building Kraken's infrastructure. As a Site Reliability Engineer, you will be keeping one of the fastest growing companies in the world up and available in a 24/7 environment. You will bring your own technical expertise to monitor and support staging and production environments, build tooling, CI/CD pipelines, deployment specs and generally automate internal processes to empower developers and improve team efficiency.

Responsibilities

  • Monitor and support Staging and Production environments
  • Improve Developer Tooling, help with building Docker images, manage our Continuous Integration (CI) pipelines for automating quality testing
  • Manage releases using Kubernetes and Nomad
  • Implement tooling to keep track of key metrics and generate alerts
  • Collaborate with Dev, QA, and Product teams to support and improve the development and release cycle
  • Develop tools and bots to improve and automate internal processes
  • Support a fully distributed team operating across numerous timezones

Requirements

  • 3+ years experience working in a SRE, DevOps or equivalent experience as a Backend Developer working with Infrastructure
  • 1+ years experience with a programming language (NodeJS, Rust, Golang, or Python)
  • Extensive experience with monitoring tools such as Grafana, Prometheus, Splunk, and ELK
  • Thorough knowledge of Docker and orchestration tools such as Kubernetes or Nomad
  • Ability to configure and maintain different types of proxy services such as Nginx and HAProxy
  • Proficient in Git source version-control
  • Passion for improving process and products
  • Experience configuring Continuous Integration (CI)
  • Ability to thrive while working independently and remotely in a team-based environment
  • Self-starter, ability to context-switch between various projects, codebases and concepts
  • Ability to independently debug problems involving the network and operating system
  • Well-versed in scripting languages, building and administration of Linux
  • Interest in security and a thoughtful and thorough consideration of the security implications of development decisions

Bonus Points

  • Passion for open-source and contributing back to the community
  • Knowledge about Cloudflare Caching, Page Rules and Workers
  • Experience with Hashicorp Vault and its PKI features
  • Experience with Kubernetes for Local development tools such as Tilt
  • Experience with ReactJS and/or NextJS frameworks
  • Experience with Cloud infrastructure
  • Experience benchmarking applications and identifying bottlenecks
  • Experience with Slack, Jira, Google, and/or Gitlab APIs
  • Experience with monitoring / alerting (primarily with Prometheus / Grafana) and knowledge of best practices in the area
  • Experience with distributed systems and technologies (gRPC, Kafka, NoSQL, SQL, Redis, ...)

Job Type: Full-time

Pay: $100,000.00 - $400,000.00 per year

Benefits:

  • Flexible schedule
  • Health insurance
  • Paid time off

Schedule:

  • Monday to Friday

Supplemental pay types:

  • Bonus pay

Work Location: Multiple Locations

Similar Jobs

Site Reliability Engineer - Client Support Services

Kraken Digital Asset Exchange

Los Angeles, CA

You will bring your own technical expertise to monitor and support staging and production environments, build tooling, CI/CD pipelines, deployment specs and…

Senior Site Reliability Engineer - Cryptowatch

Kraken Digital Asset Exchange

Los Angeles, CA

Responsible for the operation, support, and security of production infrastructure. Author automation tools to assist with deployments, logging, monitoring, and…

Site Reliability Engineer - El Segundo, CA

La Jolla Logic

El Segundo, CA

Drive knowledge sharing across development and operations by documenting lessons learned, deployment and incident processes to optimize service reliability.

Site Reliability Engineer

EZ Texting

Los Angeles, CA

Conduct system analysis, configuration management and develops improvements for system software performance, availability and reliability.

Principal DevOps Engineer

Thales Avionics, Inc. (IFE)

Irvine, CA

Thales people architect solutions that enable two-thirds of planes to take off and land safely. Is responsible for the establishment of our release strategy and…

Dev Ops / Site Reliability Engineer - Hybrid

AEG Worldwide

Los Angeles, CA

In the SRE role you will be working directly with Developers, QA, Infrastructure Engineers, Security and Compliance, Account Management and Incident Management…

Dev Ops / Site Reliability Engineer - Hybrid

AXS

Los Angeles, CA

In the SRE role you will be working directly with Developers, QA, Infrastructure Engineers, Security and Compliance, Account Management and Incident Management…

IT Architect III - Site Reliability Engineer (Hybrid Work Schedule)

Inland Empire Health Plans

Rancho Cucamonga, CA

IEHP is on a journey to adopt Production Engineering as a key part of this transformation is to create a state-of-the-art IT production operations process.

Site Reliability Engineer

Arka Technologies

Los Angeles, CA

Manage cloud infrastructure, provide resource allocation, system upgrades, user access control etc. Perform deep dives on complex system issues ranging from…

Site Reliability Engineering Manager - North America

Kraken Digital Asset Exchange

Los Angeles, CA

Kraken is looking for an experienced engineering manager to build and lead various site reliability engineering working groups. Job Types: Full-time, Contract.

Site Reliability Engineering Manager

PlayStation Global

Aliso Viejo, CA

Own the day-to-day health, uptime and reliability of all networks, servers, storage, and ancillary infrastructure to fulfill the mission of unyielding site…

AUTH Services - Site Reliability Engineer

Tek Ninjas

Lancaster, CA

Partners with the Business stakeholders and Product team in an Agile context to capture the business problem, current state, desired future state, objectives,…

Site Reliability Engineer - Rust - Core Backend

Kraken Digital Asset Exchange

Los Angeles, CA

You will bring your own technical expertise to monitor and support staging and production environments, build tooling, CI/CD pipelines, deployment specs and…

Site Reliability Engineer

Accenture

Los Angeles, CA

Experience in calculating system reliability metrics, including RPO, RTO, SLO & SLI. Built tooling to improve reliability of systems, automated remediation of…

Senior Site Reliability Engineer, Americas

Canonical - Jobs

San Bernardino, CA

Our site reliability engineers bring Python software-engineering skills and rigour to the operations domain. A wide range of engineering disciplines and career…

Senior Site Reliability Engineer

Amgen

Thousand Oaks, CA

Maintain application reliability and uptime SLAs throughout the application lifecycle using programmatic self-healing and software automation.

Senior Site Reliability Engineer, Americas

Canonical - Jobs

Los Angeles, CA

Our site reliability engineers bring Python software-engineering skills and rigour to the operations domain. A wide range of engineering disciplines and career…

Site Reliability Engineering Manager - Europe

Kraken Digital Asset Exchange

Los Angeles, CA

Kraken is looking for an experienced engineering manager to build and lead various site reliability engineering working groups. Job Types: Full-time, Contract.

Site Reliability Engineering Manager - APAC

Kraken Digital Asset Exchange

Los Angeles, CA

Kraken is looking for an experienced engineering manager to build and lead various site reliability engineering working groups. Job Types: Full-time, Contract.

Senior Site Reliability Engineer, US - Remote

Amgen

Thousand Oaks, CA

Maintain application reliability and uptime SLAs throughout the application lifecycle using programmatic self-healing and software automation.

Senior Site Reliability Engineer - Frontend Team

Kraken Digital Asset Exchange

Los Angeles, CA

You will bring your own technical expertise to monitor and support staging and production environments, build tooling, CI/CD pipelines, deployment specs and…

Site Reliability Engineer

Nexthink

Los Angeles, CA

Scale systems sustainably through mechanisms like automation, and evolve systems by pushing for changes that improve reliability and velocity.

Staff Site Reliability Engineer

Turo

Los Angeles, CA

Help us build the strategy and product roadmap for all site reliability efforts. Participate in on-call rotation and lead initiatives on incident management,…

Linux Admin/DevOps/SW Engineer

Blu Omega

Culver City, CA

Scale systems sustainably through mechanisms like automation, and evolve systems by pushing for changes that improve reliability and velocity.