🐛Developer

How to Debug Production Errors Without Losing Your Mind

Production debugging is different from local debugging. The tools and mental models that work in development often fail in production. Here's what works.

PSBy Priya Shah · Senior Software EngineerFebruary 12, 20266 min read

Free to read

A bug in production with real users affected and no local reproduction is one of the most stressful developer experiences. The key difference between developers who handle it well and those who spiral: preparation before the crisis.

The Monitoring Stack You Need

Before you need to debug production, you need visibility into it. At minimum: error tracking (Sentry captures exceptions with context), structured logging (log requests, responses, and key application events as JSON), and basic uptime monitoring (Uptime Robot or similar). These three tools transform debugging from 'I have no idea what happened' to 'here's exactly what happened and when'.

Structured Logging: The Underrated Investment

console.log('user created') is unhelpful in production. console.log(JSON.stringify({ event: 'user_created', userId, email, timestamp, source })) is actionable. Structured logs can be searched, filtered, and correlated across requests. When an error happens, you can pull the full sequence of events that led to it instead of guessing.

Reading Stack Traces

A stack trace shows the call sequence that led to an error. Read it bottom-up: the bottom is where execution started, the top is where it failed. The frames in your own code are the relevant ones — everything from Node.js internals or library internals is usually context, not cause. Find the top frame that's in your code and start there.

The Bisect Approach

If a bug was introduced by a recent deployment and you don't know which change caused it, use git bisect. Mark a known-good commit and the current (broken) commit, and git bisect does a binary search through commits, asking you at each step whether the bug exists. For a 100-commit range, you find the culprit in 7 steps. This is faster and more reliable than reading every commit manually.

Feature Flags as Debug Tools

Feature flags that can be toggled without deploying are invaluable for production debugging. If you can turn a feature off in production without redeploying, you can quickly narrow whether the feature is causing the issue. This is a architectural investment that pays back every time you have a production incident.

On-call habit

When you fix a production bug, add a regression test before closing the ticket. Document what caused it, how you found it, and what the fix was. This log becomes invaluable for future similar issues and for onboarding new engineers to the production environment.

Frequently Asked Questions

What's the best tool for tracking production errors?+

Sentry is the standard for application error tracking — it captures stack traces, user context, request data, and groups similar errors together. The free tier covers small projects. For infrastructure-level monitoring (server metrics, logs aggregation), Datadog and New Relic are the enterprise standards. For most web applications, Sentry for errors and a basic logging service (Logtail, Papertrail) covers the important bases without significant cost.

How do source maps help debug minified JavaScript?+

Minified JavaScript replaces variable names with single letters and removes all whitespace, making stack traces unreadable. Source maps are separate files that map the minified code back to the original source with real variable names and line numbers. When you set up Sentry or similar tools with source maps, errors in production show the original TypeScript or JSX file and line number instead of 'TypeError: c is not a function at bundle.min.js:1:4291'.

How do I reproduce a production bug locally?+

Start by collecting as much context as possible from the error report: exact URL, user agent, cookies/session state, request/response data, and the full stack trace. Then try to reproduce with the same inputs. If it doesn't reproduce locally, check whether the bug is environment-specific (different Node.js version, environment variables, or data that exists in production but not locally). Feature flags and staging environments that mirror production data are valuable here.

What are the most common causes of production-only bugs?+

Environment differences (different environment variables, API endpoints, or configurations), race conditions that only appear under real traffic load, data edge cases in user-generated content that your test data doesn't cover, third-party service timeouts that don't happen in development, and timing-dependent code that behaves differently under production latency. Adding detailed logging for unusual data paths — before problems occur — is the highest-leverage debugging investment.

🔧 Free Tools Used in This Guide

Json Formatter

Priya Shah

Senior Software Engineer · 9+ years experience

Priya has nine years of experience building distributed systems and developer tooling at two B2B SaaS companies. She writes about APIs, JSON/JWT workflows, regex, DevOps, and the small utilities that make debugging faster at 2am.

View all posts by Priya Shah →

Tags:

developerdebuggingproductionmonitoring

Continue Reading

💻

VS Code Extensions That Actually Save Time (2026 Edition)

6 min read