Estimating Selection Result Size (cont)
Estimating size of result for e.g.
select * from Enrolment where year > 2015;
|
Could estimate by using:
- uniform distribution assumption, r, min/max years
Assume: min(year)=2010, max(year)=2019, |Enrolment|=105
- 105 from 2010-2019 means approx 10000 enrolments/year
- this suggests 40000 enrolments since 2016
Heuristic used by some systems: | σA>c(R) | ≅ r/3
|