vsm.model.TfSeq

class vsm.model.TfSeq(corpus=None, context_type=None)

Trains a term-frequency model.

In a term-frequency model, the number of occurrences of a word type in a context is counted for all word types and contexts. Word types correspond to matrix rows and contexts correspond to matrix columns.

See Also:vsm.model.base, vsm.corpus.Corpus, scipy.sparse.coo_matrix

Methods

__init__([corpus, context_type]) Initialize TfSeq.
load(f) Takes a filename or file object and loads it as an npz archive
save(f) Takes a filename or file object and saves self.matrix in an npz archive.
train() Counts word-type occurrences per context and stores the results in
__init__(corpus=None, context_type=None)

Initialize TfSeq.

Parameters:
  • corpus (Corpus) – A Corpus object containing the training data.
  • context_type (string) – A string specifying the type of context over which the model trainer is applied.
static load(f)

Takes a filename or file object and loads it as an npz archive into a BaseModel object.

Parameters:file (str-like or file-like object) – Designates the file to read. If file is a string ending in .gz, the file is first gunzipped. See numpy.load for further details.
Returns:A dictionary storing the data found in file.
See Also:numpy.load()
save(f)

Takes a filename or file object and saves self.matrix in an npz archive.

Parameters:file (str-like or file-like object) – Designates the file to which to save data. See numpy.savez for further details.
Returns:None
See Also:numpy.savez()
train()

Counts word-type occurrences per context and stores the results in self.matrix.

Previous topic

vsm.model.TfMulti

Next topic

vsm.viewer

This Page