Content by mehrnoosh sameki, sandeep atluri, minsoo thigpen and abby palia (1)

Turn specs into evals for any agent with ASSERT

Mehrnoosh Sameki, Sandeep Atluri, Minsoo Thigpen and Abby Palia introduce ASSERT, an open-source framework that turns natural-language behavior requirements into executable evaluation pipelines for AI models and agents, generating taxonomies, stratified test cases, traces, and scored results that teams can inspect and iterate on.
News

End of content

Rejoining the server...

Rejoin failed... trying again in seconds.

Failed to rejoin.
Please retry or reload the page.

The session has been paused by the server.

Failed to resume the session.
Please reload the page.