europepmc.github.io/techblog/algorithm/2018/07/04/locating-text-html-pages.html
Preview meta tags from the europepmc.github.io website.
Linked Hostnames
6- 8 links toeuropepmc.github.io
- 6 links toeuropepmc.org
- 4 links togithub.com
- 1 link toen.wikipedia.org
- 1 link tofusejs.io
- 1 link topages.github.com
Thumbnail

Search Engine Appearance
A perfect match: locating plain text in HTML pages
SciLite is a Europe PMC tool that allows biological terms or relations, such as diseases, chemicals or protein interactions, to be highlighted for readers on abstracts and full text articles. These terms are identified as annotations by text mining algorithms, developed by a variety of text mining groups. The main challenge for the SciLite tool is locating plain text annotations in HTML pages. The challenges derive from the nature of HTML pages. Below is a list of the major challenges we faced and the solutions adopted to mitigate them. The pages contain HTML tags, obviously. For example, visit this article, and click on the “Gene Function” checkbox, on the right-hand side of the page, to see the sentence highlighted. Figure 1: Annotation containing HTML tags
Bing
A perfect match: locating plain text in HTML pages
SciLite is a Europe PMC tool that allows biological terms or relations, such as diseases, chemicals or protein interactions, to be highlighted for readers on abstracts and full text articles. These terms are identified as annotations by text mining algorithms, developed by a variety of text mining groups. The main challenge for the SciLite tool is locating plain text annotations in HTML pages. The challenges derive from the nature of HTML pages. Below is a list of the major challenges we faced and the solutions adopted to mitigate them. The pages contain HTML tags, obviously. For example, visit this article, and click on the “Gene Function” checkbox, on the right-hand side of the page, to see the sentence highlighted. Figure 1: Annotation containing HTML tags
DuckDuckGo
A perfect match: locating plain text in HTML pages
SciLite is a Europe PMC tool that allows biological terms or relations, such as diseases, chemicals or protein interactions, to be highlighted for readers on abstracts and full text articles. These terms are identified as annotations by text mining algorithms, developed by a variety of text mining groups. The main challenge for the SciLite tool is locating plain text annotations in HTML pages. The challenges derive from the nature of HTML pages. Below is a list of the major challenges we faced and the solutions adopted to mitigate them. The pages contain HTML tags, obviously. For example, visit this article, and click on the “Gene Function” checkbox, on the right-hand side of the page, to see the sentence highlighted. Figure 1: Annotation containing HTML tags
General Meta Tags
9- titleA perfect match: locating plain text in HTML pages | Europe PMC Tech Blog
- charsetUTF-8
- generatorJekyll v3.9.0
- authorFrancesco Talo'
- descriptionSciLite is a Europe PMC tool that allows biological terms or relations, such as diseases, chemicals or protein interactions, to be highlighted for readers on abstracts and full text articles. These terms are identified as annotations by text mining algorithms, developed by a variety of text mining groups. The main challenge for the SciLite tool is locating plain text annotations in HTML pages. The challenges derive from the nature of HTML pages. Below is a list of the major challenges we faced and the solutions adopted to mitigate them. The pages contain HTML tags, obviously. For example, visit this article, and click on the “Gene Function” checkbox, on the right-hand side of the page, to see the sentence highlighted. Figure 1: Annotation containing HTML tags
Open Graph Meta Tags
6- og:titleA perfect match: locating plain text in HTML pages
og:locale
en_US- og:descriptionSciLite is a Europe PMC tool that allows biological terms or relations, such as diseases, chemicals or protein interactions, to be highlighted for readers on abstracts and full text articles. These terms are identified as annotations by text mining algorithms, developed by a variety of text mining groups. The main challenge for the SciLite tool is locating plain text annotations in HTML pages. The challenges derive from the nature of HTML pages. Below is a list of the major challenges we faced and the solutions adopted to mitigate them. The pages contain HTML tags, obviously. For example, visit this article, and click on the “Gene Function” checkbox, on the right-hand side of the page, to see the sentence highlighted. Figure 1: Annotation containing HTML tags
- og:urlhttps://europepmc.github.io/techblog/algorithm/2018/07/04/locating-text-html-pages.html
- og:site_nameEurope PMC Tech Blog
Twitter Meta Tags
7- twitter:cardsummary
- twitter:site@EuropePMC_news
- twitter:creator@francesco
- twitter:titleA perfect match: locating plain text in HTML pages
- twitter:descriptionSciLite is a Europe PMC tool that allows biological terms or relations, such as diseases, chemicals or protein interactions, to be highlighted for readers on abstracts and full text articles.
Link Tags
5- alternatehttps://europepmc.github.io/techblog/feed.xml
- canonicalhttps://europepmc.github.io/techblog/algorithm/2018/07/04/locating-text-html-pages.html
- shortcut icon/techblog/favicon.ico
- stylesheethttps://fonts.googleapis.com/css?family=Open+Sans:400,700
- stylesheet/techblog/assets/css/style.css?v=99af2fde682c7a20c7f4457b97c1b3da781ebef0
Links
21- http://europepmc.org//abstract/MED/28385055
- http://europepmc.org/abstract/AGR/IND605699789
- http://europepmc.org/articles/PMC3558359
- http://fusejs.io
- https://en.wikipedia.org/wiki/Levenshtein_distance