Skip to content

APIยค

A full explanation will be added soon. In the meantime, here is a table of the Lexos API modules:

cluster A module for cluster analysis.
corpus Manages a corpus of documents.
cutter Splits documents into segments.
DTM Creates a document-term matrix.
io A set of functions for handling input-output processes.
language_model A method of training and packaging language models for use with with the Lexos tokenizer or external tools.
scrubber A destructive preprocessor normally used on texts before they are tokenised.
tokenizer A set of functions used to convert texts into spaCy tokenised spaCy docs and to manipulate those docs.
utils A set of utility functions shared by multiple modules.
visualization A set of functions for visualising data generated from documents.