Ask an Agronomy Question
How it works
- Type your agriculture or plant science question below
- 2 random AI models will answer your question
- Vote on the most helpful response
Compare Responses
Select the most helpful answer:
THANK YOU FOR VOTING
Your input helps improve AI for agronomy.
AI Model Performance
Total Ratings
0
Model Rankings
| Rank | Model | Arena Score | Total Votes |
|---|
Price vs. Performance
Objective Benchmark
Read-only performance and price data pulled from the external agronomy benchmark repo. View source repo
This tab is different from the Arena leaderboard. The Arena reflects live head-to-head user voting in this app, while the Benchmark shows objective scores from a separate fixed-question agronomy benchmark repository.
Models Benchmarked
0
Top Model
-
Latest Test Date
-
Performance View
Starts focused on the top models so the ranking is readable instead of compressed.
Price Chart View
The price chart defaults to log scale so low-cost models are easier to compare.
Overall Benchmark Performance
Compare the strongest models first, then expand to the full leaderboard if needed.
Price vs. Benchmark Score
Hover or click a point to inspect a model. Filters are usually more useful than free zoom here.
Focused Model
Hover or click a point to inspect a model in detail.
Full Benchmark Results
Includes overall score, price, access type, test date, and category-level results from the benchmark repo.
| Rank | Model | Overall Score | Price | Access | Date Tested | V1 Questions | Community FBN | Crop Mgmt | Nutrient Mgmt | Pest Mgmt | Soil & Water |
|---|
About
What is this?
This is a benchmark for Agronomy AI models. Users ask questions then rank models based on how they perform.
Why do this?
Without benchmarks, it's hard to know if models are improving or which models are better than others. This is a subjective benchmark. For a more objective benchmark, please see: LLM Agronomy Benchmark.
Author
This project is maintained by Bailey Stockdale. Please feel free to reach out with suggestions, ideas, and feedback.