Skip to main content
Blog

Engineering

The systems we build, decisions we make, and what we've learned.

Engineering

How We Evaluate LLM Accuracy for Contract Review

LLMs are powerful but inconsistent, and in contract review, small errors can be costly. Here's how Docusign's AI team built a fast, no-code evaluation system to continuously test, measure, and improve the accuracy of our contract review assistant.

Author Allison Hegel
Allison HegelLead Applied Scientist
How We Evaluate LLM Accuracy for Contract Review

Latest posts

Engineering

From Expired Flag to Merged PR: How We Built Vulture to Automate Code Cleanup

Every mature codebase accumulates dead code: expired feature flags, unreachable branches, logic that shipped years ago and never got cleaned up. So we built Vulture to automate cleanup. It's already removed ~150,000 lines of dead code from our legacy eSignature codebase in one year. Here's how we built it.

Author Abhishek Chaturvedi
Abhishek Chaturvedi
Vulture featured image