Data Quality

Michael Gleicher (email) and Xiujun Li (email)

Home   ·   Introduction   ·   Search Services   ·   Examples

Status: not open yet!

Introduction

The data is not 100% perfect due to kinds of reasons. Here this page shows some word errors information about the dataset.


Examples                   Medial s   Ι   Misspellings

'Medial s'

Example: beft [url]

Description:
The 'medial s' is raised when the letter 's' is not at the end of a word in the old English books, the OCR misclassfies it as 'f'. E.g. best - beft.

Example: teft [url]

Description:
test - teft

Misspellings

Example: becuase [url]

Description:
because - becuase

Example: milion [url]

Description:
million - milion

Example: adition [url]

Description:
addition - adition