On 2019-03-19 03:52:46 +0000, symm said:
> Initially I was going to use a 10% tolerance, but the very first test
> has a two-page image with height more than 10% smaller than the median
> height. But width within 5% of double the median width. ¯\_(ツ)_/¯
>
> ...0002.jpg is a wrap-around cover but there's no guarantee that an
> inside two-page spread will not be cropped/resized differently than the
> single page images.
>
> May have to go with more rigorous statistical analysis (iqr/standard
> deviations).
>
> ...0008.jpg is manually manipulated as there were no rotated images,
> but it should be representative.
Rotated pages would have an area (HxW) like the others but each
dimension would be way off, so logic like area < 1 standard deviation
and height > 1 sd. That should differentiate them from two-page spreads
and random smaller or larger images.
|
|