Robot Framework for SQL Testing

Nous Research just released Nomos 1, an open-source AI that ranks second on the notoriously brutal Putnam math exam

Nous Research's open-source Nomos 1 AI model scored 87/120 on the notoriously difficult Putnam math competition, ranking ...

GitHub

A composable, pipeline-based testing framework for AI systems and APIs.

Testing AI systems is hard. Responses are non-deterministic, you need to validate tool usage, and semantic meaning matters more than exact text matching.

Some results have been hidden because they may be inaccessible to you

Show inaccessible results

Nous Research just released Nomos 1, an open-source AI that ranks second on the notoriously brutal Putnam math exam

A composable, pipeline-based testing framework for AI systems and APIs.

Trending now