Validity vs reliability in research
Validity is whether you’re measuring the right thing — does your instrument actually capture the concept you intend. Reliability is whether you measure it consistently — would you get the same answer on repeat. Validity is accuracy; reliability is consistency. Examiners check both, and confusing them is a classic methods-section slip.
Why a measure can be reliable but not valid (and not the reverse)
A measure can be consistent yet wrong — reliable but invalid (the scale). But a measure cannot be valid without first being reliable: if your instrument gives a different answer each time, it can’t be consistently hitting the true value. So reliability is necessary for validity, but not sufficient. Establish reliability, then validity.
The types you should name
| Validity (right thing) | Reliability (consistent) |
|---|---|
| Face — looks right on the surface | Test–retest — same test, two time points |
| Content — covers the whole concept | Inter-rater — agreement between coders (Cohen’s κ) |
| Construct — measures the underlying construct | Internal consistency — items hang together (Cronbach’s α) |
| Criterion — agrees with a gold standard / predicts (concurrent & predictive) |
In experiments you’ll also meet internal validity (confidence the IV — not a confounder — caused the change) and external validity (how far the result generalizes).
Get the free Research Design toolkit
Validity and reliability checklists, instrument-selection worksheets, and worked examples from Research Design Simplified. We’ll email you the download link.
Frequently asked questions
Validity vs reliability?
Validity = measuring the right thing (accuracy); reliability = measuring it consistently. Both are needed.
Can a measure be reliable but not valid?
Yes (the always-3kg-heavy scale). But it can’t be valid without being reliable.
What are the types of validity?
Face, content, construct, and criterion (concurrent & predictive); plus internal and external validity in experiments.
How do I measure reliability?
Test–retest, inter-rater (Cohen’s κ), and internal consistency (Cronbach’s α) — pick the type that fits your instrument.