Beer Scoring Methods

Why Scoring Systems Exist

Scoring transforms subjective experience into comparable data. A systematic scoring method lets you track improvement over time, compare beers fairly, and communicate quality assessments to others. The right system depends on your goals: competition judging, personal tracking, or public reviewing.

BJCP 50-Point System

The gold standard for homebrew and craft beer competitions. Detailed, structured, and designed for written feedback.

Breakdown: Aroma (12), Appearance (3), Flavor (20), Mouthfeel (5), Overall Impression (10). Each section has descriptor checkboxes and space for written comments.

Strengths: Forces systematic evaluation. Produces actionable feedback. Widely understood in the brewing community. Weaknesses: Time-consuming (5-7 minutes per beer). Appearance is under-weighted relative to its diagnostic value. The 50-point scale can feel granular — is there a meaningful difference between a 32 and a 33?

Best for: Competition judging, detailed homebrew feedback, training exercises.

Professional Brewing QC Scales

Commercial breweries use simplified internal scoring for quality control:

9-point hedonic scale — 1 (dislike extremely) to 9 (like extremely). Quick, intuitive, suitable for consumer panels. Pass/fail/hold — binary quality gate. Does the beer meet release specifications? Used for production batches, not comparative evaluation. Descriptive analysis — trained panelists rate individual attributes (bitterness, malt sweetness, hop aroma intensity) on intensity scales rather than quality scales. This separates description from preference.

Consumer Rating Platforms

Untappd (1-5 scale, 0.25 increments)

The most popular beer rating app. 1-5 stars with quarter-star increments yields 17 possible ratings. Strengths: Fast, social, massive database. Weaknesses: Severe positive skew (most ratings cluster around 3.5-4.0). No structured feedback. Ratings are influenced by hype, rarity, and social pressure.

Ratebeer (1-20 subscales)

Rates aroma (1-10), appearance (1-5), flavor (1-10), palate (1-5), and overall (1-20). Total out of 50, normalized to a 0-100 overall score. Strengths: More granular than Untappd. Section-based. Weaknesses: Complex. The overall section's outsized weight (20 points) dominates the total.

BeerAdvocate (1-5 subscales)

Rates look, smell, taste, feel, and overall on 1-5 scales. Weighted to produce an overall score. Community norms and style-adjusted percentile rankings add context.

Building a Personal System

For private tasting journals, design a system that serves your goals:

Simple (5-point): 1 = drain pour, 2 = would not reorder, 3 = decent, 4 = very good, 5 = exceptional. Fast and honest. Moderate (10-point): Half-point increments add nuance without overwhelming. Aligns with common academic grading intuitions. Detailed (20-point): 4 points each for aroma, flavor, mouthfeel, balance, and drinkability. Provides per-section tracking without the full BJCP overhead.

Whichever scale you choose, define your anchor points clearly and use them consistently. An undefined 10-point scale drifts over time.

Avoiding Common Scoring Pitfalls

Novelty bias — unusual beers (barrel-aged, rare ingredients) score higher than technically superior but familiar beers. Consciously separate novelty from quality. Recency bias — the last beer in a flight scores higher or lower based on palate fatigue. Randomize flight order. Style bias — favorite styles get higher scores. Train yourself to recognize technical quality in styles you do not prefer. Halo effect — a great aroma sets expectations that inflate flavor scores. Evaluate each section independently.

Combining Systems

Many experienced tasters use a dual approach: BJCP-style detailed notes for evaluation sessions and homebrew feedback, plus a simple personal 5-point rating for casual tracking. The detailed system trains perception; the simple system builds a usable database of preferences over time.