vsm.model.TfSeq¶

class vsm.model.TfSeq(corpus=None, context_type=None)¶

Trains a term-frequency model.

In a term-frequency model, the number of occurrences of a word type in a context is counted for all word types and contexts. Word types correspond to matrix rows and contexts correspond to matrix columns.

See Also:	`vsm.model.base`, `vsm.corpus.Corpus`, `scipy.sparse.coo_matrix`

Methods

`__init__`([corpus, context_type])	Initialize TfSeq.
`load`(f)	Takes a filename or file object and loads it as an npz archive
`save`(f)	Takes a filename or file object and saves self.matrix in an npz archive.
`train`()	Counts word-type occurrences per context and stores the results in

__init__(corpus=None, context_type=None)¶

Initialize TfSeq.

Parameters:	corpus (Corpus) – A Corpus object containing the training data. context_type (string) – A string specifying the type of context over which the model trainer is applied.

static load(f)¶

Takes a filename or file object and loads it as an npz archive into a BaseModel object.

Parameters:	file (str-like or file-like object) – Designates the file to read. If file is a string ending in .gz, the file is first gunzipped. See numpy.load for further details.
Returns:	A dictionary storing the data found in file.
See Also:	`numpy.load()`

save(f)¶

Takes a filename or file object and saves self.matrix in an npz archive.

Parameters:	file (str-like or file-like object) – Designates the file to which to save data. See numpy.savez for further details.
Returns:	None
See Also:	`numpy.savez()`

train()¶: Counts word-type occurrences per context and stores the results in self.matrix.

vsm.model.TfSeq¶

Previous topic

Next topic

This Page

Navigation

vsm.model.TfSeq¶

Previous topic

Next topic

This Page

Quick search

Navigation