Head's Up! These forums are read-only. All users and content have migrated. Please join us at community.neo4j.com.
04-08-2021 03:08 PM
I have created a graph of the 2010 WikiLeaks Cablegate cables. There are 251,287 diplomatic cables in this graph.
This graph is relatively unsophisticated at the moment however I am planning to use NLP (Natural Language Processing) to process the text of the cables and create a more sophisticated graph
The files can be downloaded from github GitHub - whoiskieran/Neo4jWikiLeaks
The text of the cables are not included.
Graph Details
Node Types:
Relationships
Cable [IS_TO] -> Locations
Cable [IS_FROM] -> Locations
Cable [IS_TAGGED_WITH] -> Tag
Cable [IS_MENTIONED_IN] -> Cable
To create the graph do the following (these instructions apply to Neo4j Desktop for Windows):
04-09-2021 02:26 AM
Hi @whoiskieran ,
This is fantastic! Thanks for sharing your work. A small favor, could you tag your github repository with 'neo4j' as a topic? We're working on a website which will feature work like yours, leveraging topic tags on github to find them.
Best,
ABK
04-10-2021 01:27 PM
I have added the neo4j tag now.
04-15-2021 05:15 AM
Great work!
I would like to suggest to add a type (LOCALDATETIME
) to isolate for CABLE nodes. I.e. to replace cable_nodes_header.csv with:
originalID:ID,canonicalID,name,textdate,isodate:LOCALDATETIME,OrigClassif,CurClassif,charcount,cabletype,officeorigin,officeaction,:LABEL
BTW: How did you compile this data? Are there URLs you could include to make reference to documents somewhere?
Best, Thorsten
04-15-2021 11:17 AM
Hi
I will update the header file with this suggestion.
In relation to accessing the cables the simplest way is to use the url below
https://search.wikileaks.org/plusd/cables/.html
I have not gotten the Kissenger cables or the carter cables yet.
Thanks for your suggestion.
I have found some other ways of downloading cables.
https://file.wikileaks.org/file/
Also this GitHub repository has the raw text for a lot of the cables but not all as I am noticing.
All the sessions of the conference are now available online