Mike Lyle wrote:
> Hang about. (This is a genuine question, not a smart-arsery.) I'm
> taking "scan" in its restricted OCR-type meaning: you mean they do
> that? I wouldn't dream of getting evidence for a dictionary by that
> means, especially reading a wide variety of print styles.
OK, I'll try not to give you a "smart-arsey" response.
No. Many other organizations do it (or use "human OCR") in building up
corpora and literary collections. The publishing history of these
documents is almost always available when you access them: in major
projects like Project Gutenberg and LION, you can always trust what you
get.
There is a reading program that's been around forever: you can get more
information at askoxford.com.
|
Follow-ups: | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 | 13 | 14 | 15 | 16 | 17 | 18 | 19 | 20 | 21 | 22 | 23 | 24 |
|