User:Kiril kovachev/BED Project

Project abstract

edit

The intent of this project is to transcribe the contents of the Bulgarian Etymological Dictionary into Wikitext form so that it can be more reliably understood and to be used as a source of verifiable information. The texts are currently only available in scanned form, which naturally makes accessing them much more laborious than it need be, as searching a particular term precisely is currently impossible.

Means

edit

The project will be undertaken through the use of an OCR (optical character recognition) interface, which is able to transcribe each PNG image approximately into text. The unclear elements can then be corrected by a human editor.

PDFs

edit

I have compiled PDF forms of the available volumes, which I've uploaded to a Google Drive if anyone would wish to view them. The online reader suffers from significant buffering, at least on my end, and so a native reader would fare much better in assisting productivity.

Tomes

edit