Atla Selene

AI Model Provider Evaluation

This item was not updated in last three versions of the Radar. Should it have appeared in one of the more recent editions, there is a good chance it remains pertinent. However, if the item dates back further, its relevance may have diminished and our current evaluation could vary. Regrettably, our capacity to consistently revisit items from past Radar editions is limited.

May 2025

Assess

Atla Selene is a state-of-the-art LLM Judge trained specifically to evaluate generative AI responses. Selene excels at evaluating LLM outputs by other models involving language, coding, math, chat, RAG contexts, and more.

You can integrate Selene directly into your codebase using our Python SDK.

Key Capabilities:

Automated code documentation
Code structure analysis
API documentation generation
Change impact analysis
Integration with version control
Custom documentation templates
Team collaboration features
Multi-language support

Documentation

MOHARA should evaluate use of the Atla Selene model for projects requiring automated evaluation of LLM outputs, particularly when building AI applications that need reliable quality assessment, benchmarking different models, or implementing automated testing of generative AI components.