class Word2VecModel extends Serializable with Saveable
- Alphabetic
- By Inheritance
- Word2VecModel
- Saveable
- Serializable
- Serializable
- AnyRef
- Any
- Hide All
- Show All
- Public
- All
Instance Constructors
-
new
Word2VecModel(model: Map[String, Array[Float]])
- Annotations
- @Since( "1.5.0" )
Value Members
-
def
findSynonyms(vector: Vector, num: Int): Array[(String, Double)]
Find synonyms of the vector representation of a word, possibly including any words in the model vocabulary whose vector representation is the supplied vector.
Find synonyms of the vector representation of a word, possibly including any words in the model vocabulary whose vector representation is the supplied vector.
- vector
vector representation of a word
- num
number of synonyms to find
- returns
array of (word, cosineSimilarity)
- Annotations
- @Since( "1.1.0" )
-
def
findSynonyms(word: String, num: Int): Array[(String, Double)]
Find synonyms of a word; do not include the word itself in results.
Find synonyms of a word; do not include the word itself in results.
- word
a word
- num
number of synonyms to find
- returns
array of (word, cosineSimilarity)
- Annotations
- @Since( "1.1.0" )
-
def
getVectors: Map[String, Array[Float]]
Returns a map of words to their vector representations.
Returns a map of words to their vector representations.
- Annotations
- @Since( "1.2.0" )
-
def
save(sc: SparkContext, path: String): Unit
Save this model to the given path.
Save this model to the given path.
This saves:
- human-readable (JSON) model metadata to path/metadata/
- Parquet formatted data to path/data/
The model may be loaded using
Loader.load
.- sc
Spark context used to save model data.
- path
Path specifying the directory in which to save this model. If the directory already exists, this method throws an exception.
- Definition Classes
- Word2VecModel → Saveable
- Annotations
- @Since( "1.4.0" )
-
def
transform(word: String): Vector
Transforms a word to its vector representation
Transforms a word to its vector representation
- word
a word
- returns
vector representation of word
- Annotations
- @Since( "1.1.0" )