NovaEval

by Noveum.ai

Advanced AI Model Evaluation Platform

Powered by NovaEval Framework

About NovaEval Platform

NovaEval is an advanced AI model evaluation framework that provides comprehensive benchmarking across multiple models and datasets. This platform allows you to:

  • Compare Multiple Models: Evaluate up to 10 Hugging Face models simultaneously
  • Comprehensive Datasets: Test on 11 evaluation datasets across reasoning, knowledge, math, code, and language tasks
  • Real-time Monitoring: Watch live evaluation progress with detailed request/response logging
  • Multiple Metrics: Assess performance using accuracy, F1-score, BLEU, ROUGE, and Pass@K metrics
  • NovaEval Framework: Powered by the open-source NovaEval evaluation framework for reliable, reproducible results

Models

(0)

Dataset

Config

10 50 1000
0.0 0.7 2.0

Progress

Ready to start NovaEval

Live Logs

(Requests & Responses)
Waiting for NovaEval to start...