Incremental Sparse TFIDF Incremental Similarity With Bipartite Graphs

Sarmento Rui Portocarrero, Brazdil Pavel. Arxiv 2018

In this report, we experimented with several concepts regarding text streams analysis. We tested an implementation of Incremental Sparse TF-IDF (IS-TFIDF) and Incremental Cosine Similarity (ICS) with the use of bipartite graphs. We are using bipartite graphs - one type of node are documents, and the other type of nodes are words - to know what documents are affected with a word arrival at the stream (the neighbors of the word in the graph). Thus, with this information, we leverage optimized algorithms used for graph-based applications. The concept is similar to, for example, the use of hash tables or other computer science concepts used for fast access to information in memory.

Similar Work

Supervised Incremental Hashing
Sign-guided Bipartite Graph Hashing For Hamming Space Search
Bipartite Graph Convolutional Hashing For Effective And Efficient Top-n Search In Hamming Space
Towards Effective Top-n Hamming Search Via Bipartite Graph Contrastive Hashing

Awesome Learning to Hash

Incremental Sparse TFIDF Incremental Similarity With Bipartite Graphs

Sarmento Rui Portocarrero, Brazdil Pavel. Arxiv 2018

Similar Work