Use of An Electronic Corpus
When consulting an electronic corpus to make an assessment of word usage it is important to understand what you are dealing with. The Contemporary American Corpus, for example, does ‘exactly what it says on the tin’. It provides a cross section of contemporary spoken and written American English usage from a variety of sources. In addition to accepted use, this may include slang, misuse and simply lazy or sloppy use. Further, it is not international, and it certainly does not, in any way, attempt to define standard accepted use. In turn, the reliability of that body as an indicator of standard use will depend both on the source of its material, and also on the editorial attitude of the associated publishers. For example, a ‘pop’ magazine might include far more slang than say The New York Times. Other online facilities may attempt to cover international English usage and include data from a much wider spectrum of source materials. An example of this would be ‘Cobuild’.
For the student, presumably anxious to learn Standard International English (since that is what they will normally be tested on), this presents a problem: how should one interpret the results? A word such as ‘crore’ for example might only appear a couple of times if at all. Many native English speakers will never have heard of it. Yet in some parts of the world it is in regular daily use by millions of people! There is no question that this word is acceptable, it simply has a particular geographic origin and location.
At this point we might look once again at the phrase ‘during midday’, since this is a good example of a construction which clearly appears, at first sight, to be in regular use. Does its current use qualify it as ‘Standard English’? Does it matter? Who does it matter to? Well, for most people it doesn’t matter at all: we all know what the user means (albeit mainly from the context). It matters to the student who has an exam to pass, and it matters to the examiner. Bear in mind that in an exam it would only be seriously challenged in the event of a boundary mark or a border line fail - so not very often.
Having said that, there is another scenario where there might be a dispute: in a ‘missing word match’ question, e.g. “Underline the most appropriate missing word in the following sentence: Otters can often be seen swimming ____ midday. (while, in, during, at) etc.” A marker might flag up the fact that ‘during’ is regularly being selected and wonders if it should be allowed. (This will happen increasingly as automated marking systems are deployed, since they will be programmed to throw up such anomalies.) This usage seems not to appear in the grammar books, so the chief examiner is called on to make a decision. Note: The examiner has no wish to catch the students out, if there is reasonable doubt, the candidate would normally be awarded the mark.
There is no hard and fast rule. In the example above, the search throws up twenty or so possible instances of use, an average of around one per year over the short period analysed. With a thousand-year-old word as common as ‘midday’, and in general use across all groups of society, we would expect to see several hundred citations from all types of publication, including Washington Post, New Yorker, various learned journals, etc., as we would if we searched on ‘at midday’. What we actually find is that its use appears predominantly limited to a handful of magazine articles almost exclusively devoted to hunting and fishing, indeed several of these articles appear to have been written by the same hand. The examiner might decide this was actually telling us more about the source than confirming the structure’s place in ‘Standard English’. Could it be that it indicates sloppy usage, perhaps a lazy euphemism for ‘the middle of the day’? The editor of an outdoor magazine is unlikely to be very concerned either way.
So, what should we teach the student? The safest explanation is probably that the accepted meaning of ‘midday’ is ‘a point in time’ so ‘at midday’ is correct, but that it is also occasionally used to mean ‘a period of time around the middle of the day’. Will it become part of Standard International English? By definition, it cannot simultaneously mean both ‘a point in time’ and ‘a period of time’. In order to do so its meaning has to become ambiguous. Is there a general need for this additional meaning, which would eventually drive it into standard use? Only time will tell.