europepmc.github.io/techblog/algorithm/2018/07/04/locating-text-html-pages.html

Preview meta tags from the europepmc.github.io website.

Linked Hostnames

Thumbnail

Search Engine Appearance

Google

https://europepmc.github.io/techblog/algorithm/2018/07/04/locating-text-html-pages.html

A perfect match: locating plain text in HTML pages

SciLite is a Europe PMC tool that allows biological terms or relations, such as diseases, chemicals or protein interactions, to be highlighted for readers on abstracts and full text articles. These terms are identified as annotations by text mining algorithms, developed by a variety of text mining groups. The main challenge for the SciLite tool is locating plain text annotations in HTML pages. The challenges derive from the nature of HTML pages. Below is a list of the major challenges we faced and the solutions adopted to mitigate them. The pages contain HTML tags, obviously. For example, visit this article, and click on the “Gene Function” checkbox, on the right-hand side of the page, to see the sentence highlighted. Figure 1: Annotation containing HTML tags

Bing

A perfect match: locating plain text in HTML pages

https://europepmc.github.io/techblog/algorithm/2018/07/04/locating-text-html-pages.html

DuckDuckGo

https://europepmc.github.io/techblog/algorithm/2018/07/04/locating-text-html-pages.html

A perfect match: locating plain text in HTML pages

General Meta Tags
9
- title
  A perfect match: locating plain text in HTML pages | Europe PMC Tech Blog
- charset
  UTF-8
- generator
  Jekyll v3.9.0
- author
  Francesco Talo'
- description
  SciLite is a Europe PMC tool that allows biological terms or relations, such as diseases, chemicals or protein interactions, to be highlighted for readers on abstracts and full text articles. These terms are identified as annotations by text mining algorithms, developed by a variety of text mining groups. The main challenge for the SciLite tool is locating plain text annotations in HTML pages. The challenges derive from the nature of HTML pages. Below is a list of the major challenges we faced and the solutions adopted to mitigate them. The pages contain HTML tags, obviously. For example, visit this article, and click on the “Gene Function” checkbox, on the right-hand side of the page, to see the sentence highlighted. Figure 1: Annotation containing HTML tags
Open Graph Meta Tags
6
- og:title
  A perfect match: locating plain text in HTML pages
- og:locale
  en_US
- og:description
  SciLite is a Europe PMC tool that allows biological terms or relations, such as diseases, chemicals or protein interactions, to be highlighted for readers on abstracts and full text articles. These terms are identified as annotations by text mining algorithms, developed by a variety of text mining groups. The main challenge for the SciLite tool is locating plain text annotations in HTML pages. The challenges derive from the nature of HTML pages. Below is a list of the major challenges we faced and the solutions adopted to mitigate them. The pages contain HTML tags, obviously. For example, visit this article, and click on the “Gene Function” checkbox, on the right-hand side of the page, to see the sentence highlighted. Figure 1: Annotation containing HTML tags
- og:url
  https://europepmc.github.io/techblog/algorithm/2018/07/04/locating-text-html-pages.html
- og:site_name
  Europe PMC Tech Blog
Twitter Meta Tags
7
- twitter:card
  summary
- twitter:site
  @EuropePMC_news
- twitter:creator
  @francesco
- twitter:title
  A perfect match: locating plain text in HTML pages
- twitter:description
  SciLite is a Europe PMC tool that allows biological terms or relations, such as diseases, chemicals or protein interactions, to be highlighted for readers on abstracts and full text articles.
Link Tags
5
- alternate
  https://europepmc.github.io/techblog/feed.xml
- canonical
  https://europepmc.github.io/techblog/algorithm/2018/07/04/locating-text-html-pages.html
- shortcut icon
  /techblog/favicon.ico
- stylesheet
  https://fonts.googleapis.com/css?family=Open+Sans:400,700
- stylesheet
  /techblog/assets/css/style.css?v=99af2fde682c7a20c7f4457b97c1b3da781ebef0

europepmc.github.io/techblog/algorithm/2018/07/04/locating-text-html-pages.html

Linked Hostnames

Thumbnail

Search Engine Appearance

Google

A perfect match: locating plain text in HTML pages

Bing

A perfect match: locating plain text in HTML pages

DuckDuckGo

A perfect match: locating plain text in HTML pages

General Meta Tags

Open Graph Meta Tags

Twitter Meta Tags

Link Tags

Links