Agronomy AI Arena

Ask an Agronomy Question

How it works

Type your agriculture or plant science question below
2 random AI models will answer your question
Vote on the most helpful response

AI Model Performance

LOADING STATISTICS...

Total Ratings

Model Rankings

Rank	Model	Arena Score	Total Votes

Price vs. Performance

Objective Benchmark

Read-only performance and price data pulled from the external agronomy benchmark repo. View source repo

This tab is different from the Arena leaderboard. The Arena reflects live head-to-head user voting in this app, while the Benchmark shows objective scores from a separate fixed-question agronomy benchmark repository.

LOADING BENCHMARK DATA...

Models Benchmarked

Top Model

Latest Test Date

Filter Models

These filters apply to every chart, model picker, and table row.

Access Price Model Name

Overall Benchmark Performance

Choose one score to rank models. The global filters above still apply.

Rank By Show

Price vs. Benchmark Score

Hover or click a point to inspect a model. Filters are usually more useful than free zoom here.

Focused Model

Hover or click a point to inspect a model in detail.

Price Axis

Actual Benchmark Cost vs. Performance

Measured OpenRouter charges and token usage from this exact benchmark run, including reasoning tokens and retries.

X Axis

Historical runs without captured usage are shown as unavailable rather than estimated. Rerunning a model records exact charged cost, prompt/completion/reasoning tokens, caching, and provider usage.

Local Model Efficiency

Open-weight models only. Models above the frontier are the strongest performers at their footprint.

X Axis

Weight-memory estimates exclude KV cache, activations, runtime overhead, and multimodal encoders. MoE active parameters are shown in tooltips when published.

Model Category Profile

Pick a lab and model to see its full per-category breakdown — where it excels and where it struggles.

Lab Model

Full Benchmark Results

Includes overall score, price, access type, test date, and category-level results from the benchmark repo. Reflects the active filters above.

Rank	Model	Overall Score	Price	Access	Date Tested	Disease ID	Pest ID	Weed ID	V1 Questions	Community FBN	Crop Mgmt	Nutrient Mgmt	Pest Mgmt	Soil & Water

Agronomy AI Arena

Ask an Agronomy Question

Compare Responses

THANK YOU FOR VOTING