Gremlin

Software Development

San Jose, California 12,291 followers

The Reliability Management Platform for high-velocity engineering teams

See jobs Follow

Discover all 64 employees

About us

Gremlin’s Reliability Management Platform enables high-velocity engineering teams to standardize and automate reliability across their organizations without slowing down software delivery. Gremlin's Reliability Score sets the standard for reliability so there's no guesswork, and an automated suite of Reliability Management tools makes it easy to integrate reliability throughout the software lifecycle so there's no slowdown.

Website: http://www.gremlin.com
External link for Gremlin
Industry: Software Development
Company size: 51-200 employees
Headquarters: San Jose, California
Type: Privately Held
Founded: 2016
Specialties: Distributed Systems, Resilience, Failures as a Service, DevOps, and Chaos Engineering

Locations

Primary

55 S Market St

Ste 1205

San Jose, California 95113, US

Get directions
555 Montgomery St

Ste 811

San Francisco, California 94111, US

Get directions

Employees at Gremlin

See all employees

Updates

Gremlin

12,291 followers
13h
Report this post
What does proactive reliability actually look like in practice — and how do you explain it to leadership? Kolton Andrus joined Techstrong TV to discuss exactly that, including why the cost of an outage goes far beyond the revenue floor. There's the engineering cost: the days spent triaging, fixing, communicating, doing post-mortems. And there's the trust cost — especially for organizations where uptime is core to the product. Watch the full interview: https://hubs.la/Q043CL-10

Gremlin Expands Proactive Reliability with Disaster Recovery Testing - Techstrong TV https://techstrong.tv

Like Comment Share
Gremlin

12,291 followers
4d
Report this post
More microservices mean more failure modes — and more places for issues to hide. 🫣 With over 120 microservices powering payments worldwide, Visa Cross-Border Solutions needed to standardize reliability testing without slowing teams down. 📊 They used Gremlin to create custom test suites for each service, then automated them across environments. The result was consistent, scalable resilience testing — and deployments that engineers can trust. 🤝 Learn how Visa Cross-Border Solutions standardized reliability across teams: https://hubs.la/Q03WYpmH0

Creating a Culture of Reliability at Visa Cross-Border Solutions gremlin.com

1 Comment

Like Comment Share
Gremlin

12,291 followers
5d
Report this post
See why Gremlin is the top choice for major retailers at https://hubs.la/Q046Yn6y0
Like Comment Share
Gremlin

12,291 followers
6d
Report this post
Every major outage reminds us just how interconnected modern architectures are. This story isn’t new. It’s a continued risk as architectures grow in an ever-increasing web of dependencies and services. Fortunately, there is something you can do about it. Check out these testing best practices teams should follow to minimize the impact of large-scale outages so they don’t catch you by surprise. ⬇️ https://hubs.ly/Q046XSlm0

How to be prepared for cloud provider outages gremlin.com

Like Comment Share
Gremlin

12,291 followers
1w
Report this post
Resilience is increasingly a governance question, not just an engineering one. Investors, auditors, and regulators are asking what resilience looks like in practice- not just in policy. For scaling companies preparing for IPO, that means S-1 filings that can speak credibly to digital resilience. For public companies, it means 10-K disclosures that reflect real operational risk management. Gremlin's Disaster Recovery Testing produces detailed reports on service performance that are designed to support exactly this kind of accountability- making it easier to demonstrate proactive reliability efforts to the audiences who need to see them. EM360Tech covers the full picture, from how DRT works to how organizations are using it to close the gap between planning and proof. Read the full analysis: https://hubs.la/Q043CKHv0

Disaster Recovery Testing (DRT) by Gremlin | EM360Tech em360tech.com

Like Comment Share
Gremlin

12,291 followers
1w
Report this post
Great to see Gremlin on the list! Thanks for the shoutout, CloudZero! https://lnkd.in/etMr7Tp6

The 55 Best DevOps Tools In 2026: The Definitive List cloudzero.com

Like Comment Share
Gremlin

12,291 followers
1w
Report this post
2025 saw 15,000+ outages across the internet. Is your system prepared for the next major incident? Get started at https://hubs.la/Q046YfSf0

Disaster Recovery Testing gremlin.com

Like Comment Share
Gremlin

12,291 followers
1w
Report this post
Most organizations know they should be running large-scale disaster recovery tests. They also know it's not practical to run them the way they've traditionally been done. Here's what changes with Gremlin's Disaster Recovery Testing: ➡ Select the services you want to test across your entire organization- not just one team's slice of the stack ➡ Choose from pre-built Scenarios for zone redundancy, region evacuation, DNS redundancy, and more- or bring your own ➡ Disaster Recovery Health Checks automatically halt and revert the test if key metrics go outside your SLA ➡ After the test, get a full report: which services passed, which failed, which teams own them, and what to fix first That's what repeatable DR readiness looks like. See how it works: https://hubs.la/Q043CrZJ0

How to run a Disaster Recovery Test gremlin.com

Like Comment Share
Gremlin reposted this
Kolton Andrus
1w
Report this post
I've been knee deep in Reliability and Chaos Engineering for the past 17 years. How has it evolved during that time? How is it evolved as the software industry has been turned on its head over the past 18 months? I wrote up this piece to share my thoughts, and I'd love to hear your opinions (tell me where I'm wrong!). https://lnkd.in/gXTb4aPQ

The evolution of chaos engineering: From Chaos Monkey at Netflix to reliability management in the AI era ciodive.com

4 Comments

Like Comment Share
Gremlin

12,291 followers
1w
Report this post
With teams shipping AI-assisted code faster than ever, 2026 is shaping up to be the year reliability becomes harder to ignore. More code velocity means more surface area for failure. More AI-driven products built on cloud infrastructure means more dependency on uptime that teams haven't fully pressure-tested. And when a major cloud region goes down — and it will — the question isn't whether your systems will be affected. It's whether you've already proven they can recover. TechBullioncovered Gremlin's new Disaster Recovery Testing with this in mind: in an AI-shaped world, availability isn't just a technical goal. It's what trust is built on. Read more: https://hubs.la/Q043CG070

Gremlin Launches Disaster Recovery Testing, Helping Businesses Avoid Major Cloud Outages https://techbullion.com

Like Comment Share

Browse jobs

Funding

Gremlin 3 total rounds

Last Round

Series B Oct 28, 2018

US$ 18.0M

Investors

Redpoint

See more info on crunchbase

Gremlin

Software Development

San Jose, California 12,291 followers

The Reliability Management Platform for high-velocity engineering teams

About us

Locations

Employees at Gremlin

Jason Heller

Stefano Pirovano

Kolton Andrus

Jason Day

Updates

Join now to see what you are missing

Similar pages

Evervault

Draftwise

Feathery

AttackIQ

Valdera

BrightHire

Pepper

Science Exchange

Vouch

Duna

Browse jobs

Engineer jobs

Account Executive jobs

Manager jobs

Developer jobs

Access Manager jobs

Analyst jobs

Director Business Intelligence jobs

Accounts Receivable Accountant jobs

Revenue Accounting Manager jobs

Director jobs

Performance Test Engineer jobs

Cyber Security Specialist jobs

Quality Assurance Automation Engineer jobs

Vice President jobs

Operations Manager jobs

Senior Manager jobs

Financial Controller jobs

Accounting Manager jobs

Director of Marketing Operations jobs

Controller jobs

Funding