Path: news.nzbot.com!not-for-mail
From: "credoquaabsurdum" <credoquaabsurdum@yahoo.com>
Newsgroups: alt.languages.english
Subject: Re: used to
Date: 15 Jul 2005 15:21:22 -0700
Organization: http://groups.google.com
Lines: 20
Message-ID: <1121466082.844045.101500@g43g2000cwa.googlegroups.com>
References: <1120427607.208666.126580@g43g2000cwa.googlegroups.com>
<slrndcitiq.rrd.chris@ccserver.keris.net>
<pskjc1hpl89ib7cn6mq3p6rf60lrj372lt@4ax.com>
<slrndclb7q.gu1.chris@ccserver.keris.net>
<1120991765.440768.39790@g44g2000cwa.googlegroups.com>
<slrndd27pr.pm8.chris@ccserver.keris.net>
<1121041347.535908.253920@f14g2000cwb.googlegroups.com>
<3jg4ovFpsiliU1@individual.net>
<1121277453.522891.209910@g44g2000cwa.googlegroups.com>
<3jpvcnFrbhsuU1@individual.net>
NNTP-Posting-Host: 212.205.252.116
Mime-Version: 1.0
Content-Type: text/plain; charset="iso-8859-1"
X-Trace: posting.google.com 1121466086 15867 127.0.0.1 (15 Jul 2005 22:21:26 GMT)
X-Complaints-To: groups-abuse@google.com
NNTP-Posting-Date: Fri, 15 Jul 2005 22:21:26 +0000 (UTC)
In-Reply-To: <3jpvcnFrbhsuU1@individual.net>
User-Agent: G2/0.2
Complaints-To: groups-abuse@google.com
Injection-Info: g43g2000cwa.googlegroups.com; posting-host=212.205.252.116;
posting-account=GZZa0w0AAADoATjy0fPIWxXr6_YULlMz
Xref: news.nzbot.com alt.languages.english:881
Mike Lyle wrote:
> Hang about. (This is a genuine question, not a smart-arsery.) I'm
> taking "scan" in its restricted OCR-type meaning: you mean they do
> that? I wouldn't dream of getting evidence for a dictionary by that
> means, especially reading a wide variety of print styles.
OK, I'll try not to give you a "smart-arsey" response.
No. Many other organizations do it (or use "human OCR") in building up
corpora and literary collections. The publishing history of these
documents is almost always available when you access them: in major
projects like Project Gutenberg and LION, you can always trust what you
get.
There is a reading program that's been around forever: you can get more
information at askoxford.com.
|
Follow-ups: | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 | 13 | 14 | 15 | 16 | 17 | 18 | 19 | 20 | 21 | 22 | 23 | 24 |
|