europepmc.github.io/techblog/algorithm/2018/07/04/locating-text-html-pages.html

Preview meta tags from the europepmc.github.io website.

Linked Hostnames

6

Thumbnail

Search Engine Appearance

Google

https://europepmc.github.io/techblog/algorithm/2018/07/04/locating-text-html-pages.html

A perfect match: locating plain text in HTML pages

SciLite is a Europe PMC tool that allows biological terms or relations, such as diseases, chemicals or protein interactions, to be highlighted for readers on abstracts and full text articles. These terms are identified as annotations by text mining algorithms, developed by a variety of text mining groups. The main challenge for the SciLite tool is locating plain text annotations in HTML pages. The challenges derive from the nature of HTML pages. Below is a list of the major challenges we faced and the solutions adopted to mitigate them. The pages contain HTML tags, obviously. For example, visit this article, and click on the “Gene Function” checkbox, on the right-hand side of the page, to see the sentence highlighted. Figure 1: Annotation containing HTML tags



Bing

A perfect match: locating plain text in HTML pages

https://europepmc.github.io/techblog/algorithm/2018/07/04/locating-text-html-pages.html

SciLite is a Europe PMC tool that allows biological terms or relations, such as diseases, chemicals or protein interactions, to be highlighted for readers on abstracts and full text articles. These terms are identified as annotations by text mining algorithms, developed by a variety of text mining groups. The main challenge for the SciLite tool is locating plain text annotations in HTML pages. The challenges derive from the nature of HTML pages. Below is a list of the major challenges we faced and the solutions adopted to mitigate them. The pages contain HTML tags, obviously. For example, visit this article, and click on the “Gene Function” checkbox, on the right-hand side of the page, to see the sentence highlighted. Figure 1: Annotation containing HTML tags



DuckDuckGo

https://europepmc.github.io/techblog/algorithm/2018/07/04/locating-text-html-pages.html

A perfect match: locating plain text in HTML pages

SciLite is a Europe PMC tool that allows biological terms or relations, such as diseases, chemicals or protein interactions, to be highlighted for readers on abstracts and full text articles. These terms are identified as annotations by text mining algorithms, developed by a variety of text mining groups. The main challenge for the SciLite tool is locating plain text annotations in HTML pages. The challenges derive from the nature of HTML pages. Below is a list of the major challenges we faced and the solutions adopted to mitigate them. The pages contain HTML tags, obviously. For example, visit this article, and click on the “Gene Function” checkbox, on the right-hand side of the page, to see the sentence highlighted. Figure 1: Annotation containing HTML tags

  • General Meta Tags

    9
    • title
      A perfect match: locating plain text in HTML pages | Europe PMC Tech Blog
    • charset
      UTF-8
    • generator
      Jekyll v3.9.0
    • author
      Francesco Talo'
    • description
      SciLite is a Europe PMC tool that allows biological terms or relations, such as diseases, chemicals or protein interactions, to be highlighted for readers on abstracts and full text articles. These terms are identified as annotations by text mining algorithms, developed by a variety of text mining groups. The main challenge for the SciLite tool is locating plain text annotations in HTML pages. The challenges derive from the nature of HTML pages. Below is a list of the major challenges we faced and the solutions adopted to mitigate them. The pages contain HTML tags, obviously. For example, visit this article, and click on the “Gene Function” checkbox, on the right-hand side of the page, to see the sentence highlighted. Figure 1: Annotation containing HTML tags
  • Open Graph Meta Tags

    6
    • og:title
      A perfect match: locating plain text in HTML pages
    • US country flagog:locale
      en_US
    • og:description
      SciLite is a Europe PMC tool that allows biological terms or relations, such as diseases, chemicals or protein interactions, to be highlighted for readers on abstracts and full text articles. These terms are identified as annotations by text mining algorithms, developed by a variety of text mining groups. The main challenge for the SciLite tool is locating plain text annotations in HTML pages. The challenges derive from the nature of HTML pages. Below is a list of the major challenges we faced and the solutions adopted to mitigate them. The pages contain HTML tags, obviously. For example, visit this article, and click on the “Gene Function” checkbox, on the right-hand side of the page, to see the sentence highlighted. Figure 1: Annotation containing HTML tags
    • og:url
      https://europepmc.github.io/techblog/algorithm/2018/07/04/locating-text-html-pages.html
    • og:site_name
      Europe PMC Tech Blog
  • Twitter Meta Tags

    7
    • twitter:card
      summary
    • twitter:site
      @EuropePMC_news
    • twitter:creator
      @francesco
    • twitter:title
      A perfect match: locating plain text in HTML pages
    • twitter:description
      SciLite is a Europe PMC tool that allows biological terms or relations, such as diseases, chemicals or protein interactions, to be highlighted for readers on abstracts and full text articles.
  • Link Tags

    5
    • alternate
      https://europepmc.github.io/techblog/feed.xml
    • canonical
      https://europepmc.github.io/techblog/algorithm/2018/07/04/locating-text-html-pages.html
    • shortcut icon
      /techblog/favicon.ico
    • stylesheet
      https://fonts.googleapis.com/css?family=Open+Sans:400,700
    • stylesheet
      /techblog/assets/css/style.css?v=99af2fde682c7a20c7f4457b97c1b3da781ebef0

Links

21