text_quality.feature.scorer.dictionary

Module Contents

Classes

Dictionary

Abstract class for scorers to compute feature values

TokenDictionary

Abstract class for scorers to compute feature values

HunspellDictionary

Abstract class for scorers to compute feature values

class text_quality.feature.scorer.dictionary.Dictionary(dictionary)[source]

Bases: text_quality.feature.scorer.scorer.Scorer

Abstract class for scorers to compute feature values

abstract _lookup(token: str) bool[source]
score(tokens: List[str]) float[source]

See Nautilus-OCR

class text_quality.feature.scorer.dictionary.TokenDictionary(dictionary)[source]

Bases: Dictionary

Abstract class for scorers to compute feature values

_lookup(token: str) bool[source]
to_file(filepath: pathlib.Path, sort: bool = True, overwrite: bool = False)[source]
classmethod from_file(filepath: pathlib.Path)[source]
class text_quality.feature.scorer.dictionary.HunspellDictionary(dictionary)[source]

Bases: Dictionary

Abstract class for scorers to compute feature values

_lookup(token: str) bool[source]
classmethod from_path(path: pathlib.Path, language: str) HunspellDictionary[source]