Head's Up! These forums are read-only. All users and content have migrated. Please join us at community.neo4j.com.
01-01-2019 10:17 PM
I am a neo4j beginner. I have some questions about importing data.
When we import billions of data, how to use java to import, what better information? What better way? Must they all be converted to CSV format? If converted to csv, is it necessary to pre-organize and modify it according to the specified format?
Thank you
01-02-2019 04:30 AM
First you should make sure that what would be your data model according to that you would frame queries for data import.
You should mention that what would be your data source and yes you have to make sure that your system must have the configuration to handle billions of records .
01-02-2019 07:41 AM
There are lots of different ways of importing data into Neo4j.
Probably the most common route is to use CSV, so here's a page giving you full details on that:
Broadly, you can either use the LOAD CSV
method or you can use the neo4j-import
tool. If you only have a little data, LOAD CSV is easier. If you have lots and lots of data and high speed import is your highest priority, then you want neo4j-import.
01-02-2019 04:56 PM
ok!!I'll give it a try.
thank you.
01-02-2019 10:14 PM
Fastest way is using neo4j-admin import , this requires formatting all csvs. For billions of nodes its best to use descending ordering if at all possible. See here: https://neo4j.com/docs/operations-manual/current/tools/import/
You can import data in parallel and for approximately 1.8 TB of data I got around 4 hours on a 32 CPU machine with 244 GB ram. You will need a lot of RAM for the import for sure.
All the sessions of the conference are now available online