Head's Up! These forums are read-only. All users and content have migrated. Please join us at community.neo4j.com.
11-04-2019 09:53 AM
I would like to dynamically create some product information from scraping a version table.
I'd like to create (:Esxiversion) with .version .name .releasedate .buildnumber .installerbuild from
https://kb.vmware.com/s/article/2143832
I'm not exactly sure how to use the tags to consume the table data within the HTML. any hints to get started would be helpful.
With "https://kb.vmware.com/s/article/2143832" as url
call apoc.load.html(url) yield value
return value
12-12-2019 12:04 PM
Hi, @pdrangeid
I believe that you cant fetch information from that site. As it uses Content Security Policy (CSP) that protects this site content.
If you query
WITH "https://kb.vmware.com/s/article/2143832/" as url
CALL apoc.load.html(url,{target: 'meta'}) YIELD value
RETURN value
you see in response that
"http-equiv": "Content-Security-Policy"
01-03-2020 03:31 AM
Consulted with andrea.larus on neo4j-ninjas slack channel - and he explained that the real issue seems to be that page is generated by javascript runtime.
if you put the following link in Chrome URL bar view-source:https://kb.vmware.com/s/article/2143832
then there is no table tag at all.
Created issue post in APOC repo: https://github.com/neo4j-contrib/neo4j-apoc-procedures/issues/1372
01-03-2020 08:35 AM
Paul,
Thanks for the follow-up. I was looking at the page content with a colleague and had determined that it was dynamically delivered content, but wasn't sure how that affected the ability to collect it with apoc.
Thanks for submitting the issue!
All the sessions of the conference are now available online