Industry Word2Vec (Beta)

Pre-trained Industry Word2vec Models

Overview


The Eventn Word2Vec service provides pre-trained word2vec models to perform a lexical analysis on. The services makes it easy to keep a finger on the pulse of the most important information across industries from a single API.

The Eventplatform continuously crawls sites across the web in real-time to build vast industry focused data corpuses ready for analysis.

Industries

The following industries are supported by the EventWord2Vec service:

Industry NameModel Name
Automotiveautomotive
High-Techhightech
Moviesmovies
Musicmusic

Looking for an industry that is not currently supported? Feel free to contact us to get your industry of choice added.

Getting Started

The following is an example of themostSimilar()method whereby submitting a word for a given industry, the closest related terms will be returned:

Test the service by making a GET request passing in the keyword term and industry name:

https://service.eventn.com/{SERVICE_ID}?term=elon_musk&industry=hightech

Example response:

API Methods


loadModel(industry)

Loads a specified industry model containing vector representations. See the Industries section for the model name syntax.


mostSimilar(phrase, number)

Calculates the cosine distance between the supplied phrase (a string which is internally converted to an Array of words, which result in a phrase vector) and the other word vectors of the vocabulary. Returned are the number words with the highest similarity to the supplied phrase. If number is not supplied, by default the 40 highest scoring words are returned. If none of the words in the phrase appears in the dictionary, the function returnsnull. In all other cases, unknown words will be dropped in the computation of the cosine distance.


similarity(phrase1, phrase2)


analogy(word, pair, number)


getVector(word)


getVectors(words)


getNearestWord(vector)


getNearestWords(vector, number)