arxiv.org/abs/2108.10341

Preview meta tags from the arxiv.org website.

Linked Hostnames

Thumbnail

Search Engine Appearance

Google

https://arxiv.org/abs/2108.10341

Query Embedding Pruning for Dense Retrieval

Recent advances in dense retrieval techniques have offered the promise of being able not just to re-rank documents using contextualised language models such as BERT, but also to use such models to identify documents from the collection in the first place. However, when using dense retrieval approaches that use multiple embedded representations for each query, a large number of documents can be retrieved for each query, hindering the efficiency of the method. Hence, this work is the first to consider efficiency improvements in the context of a dense retrieval approach (namely ColBERT), by pruning query term embeddings that are estimated not to be useful for retrieving relevant documents. Our proposed query embeddings pruning reduces the cost of the dense retrieval operation, as well as reducing the number of documents that are retrieved and hence require to be fully scored. Experiments conducted on the MSMARCO passage ranking corpus demonstrate that, when reducing the number of query embeddings used from 32 to 3 based on the collection frequency of the corresponding tokens, query embedding pruning results in no statistically significant differences in effectiveness, while reducing the number of documents retrieved by 70%. In terms of mean response time for the end-to-end to end system, this results in a 2.65x speedup.

Bing

Query Embedding Pruning for Dense Retrieval

https://arxiv.org/abs/2108.10341

DuckDuckGo

https://arxiv.org/abs/2108.10341

Query Embedding Pruning for Dense Retrieval

General Meta Tags
17
- title
  [2108.10341] Query Embedding Pruning for Dense Retrieval
- title
  open search
- title
  open navigation menu
- title
  contact arXiv
- title
  subscribe to arXiv mailings
Open Graph Meta Tags
10
- og:type
  website
- og:site_name
  arXiv.org
- og:title
  Query Embedding Pruning for Dense Retrieval
- og:url
  https://arxiv.org/abs/2108.10341v1
- og:image
  /static/browse/0.3.4/images/arxiv-logo-fb.png
Twitter Meta Tags
6
- twitter:site
  @arxiv
- twitter:card
  summary
- twitter:title
  Query Embedding Pruning for Dense Retrieval
- twitter:description
  Recent advances in dense retrieval techniques have offered the promise of being able not just to re-rank documents using contextualised language models such as BERT, but also to use such models to...
- twitter:image
  https://static.arxiv.org/icons/twitter/arxiv-logo-twitter-square.png
Link Tags
12
- apple-touch-icon
  /static/browse/0.3.4/images/icons/apple-touch-icon.png
- canonical
  /abs/2108.10341
- icon
  /static/browse/0.3.4/images/icons/favicon-32x32.png
- icon
  /static/browse/0.3.4/images/icons/favicon-16x16.png
- manifest
  /static/browse/0.3.4/images/icons/site.webmanifest

arxiv.org/abs/2108.10341

Linked Hostnames

Thumbnail

Search Engine Appearance

Google

Query Embedding Pruning for Dense Retrieval

Bing

Query Embedding Pruning for Dense Retrieval

DuckDuckGo

Query Embedding Pruning for Dense Retrieval

General Meta Tags

Open Graph Meta Tags

Twitter Meta Tags

Link Tags

Links