Head's Up! These forums are read-only. All users and content have migrated. Please join us at community.neo4j.com.
07-23-2021 09:23 AM
Hi there
I am facing a problem in importing CSV files using neo4j-admin. I have the following nodes and relationship files.
#Nodes
-Node 1 header
addID:ID(Address),address:string,userID:IGNORE
-Node 1 csv
0,1A1zP1eP5QGefi2DMPTfTL5SLmv7DivfNa
1,12c6DSiU4Rq3P4ZxziKxzrL5LmMBrzjrJX
2,1HLoD9E4SDFFPDiYfNYnkBLQ85Y51J3Zb1
3,1FvzCLoTPGANNjWoUo6jUGuAG3wg1w4YjR
4,15ubicBBWFnvoZLT7GiU2qxjRaKJPdkDMG
5,1JfbZRwdDHKZmuiZgYArJZhcuuzuw2HuMu
-Node 2 header
txID:ID(Transaction),txhash,n_inputs,n_outputs
-Node 1 csv
0,4a5e1e4baab89f3a32518a88c31bc87f618f76673e2cc77ab2127b7afdeda33b,0,1
1,0e3e2357e806b6cdb1f70b54c3a3a17b6714ee1f0e68bebb44a74b1efd512098,0,1
2,9b0fc92260312ce44e74ef369f5c66bbb85848f2eddd5a7a1cde251e54ccfdd5,0,1
3,999e1c837c76a1b7fbb7e57baf87b309960f5ffefbf2a9b95dd890602272f644,0,1
4,df2b060fa2e5e9c8ed5eaf6a45c13753ec8c63282b2688322eba40cd98ea067a,0,1
5,63522845d294ee9b0188ae5cac91bf389a0c3723f084ca1025e7d9cdfe481ce1,0,1
#Relationships
-Relationship csv 1
504,367,5000000000
504,192,100000000
504,255,1000000000
533,293,5000000000
555,533,2500000000
--Relationship csv 2
0,0,5000000000
1,1,5000000000
2,2,5000000000
3,3,5000000000
4,4,5000000000
5,5,5000000000
-Here is the script I am using to import the data
bin\neo4j-admin import --nodes=Address="import/addresses-header.csv,import/addresses." --nodes=Txhash="import/txhash-header.csv,import/txhash." --relationships=TO="import/incoming-header.csv,import/txin.* " --relationships=SENDING="import/outcoming-header.csv,import/txout.*" --skip-duplicate-nodes --high-io=true --skip-bad-relationships --delimiter=","
There are more than 400 files containing more than 2 billion lines. The import is done successfully but I notice 700 million of the relationships are missing and logged into the import.report.
I am getting this error when the importing is completed
"There were bad entries which were skipped and logged into import.report file"
and here is the sample of the import.report file
370055140 465945065 398435658 (Address)-[SENDING]->null (global id space) is missing data
So far I could not figure out what the error is meaning or how to resolve it.
Can anyone help me to overcome this problem?
Thank you in advance
07-30-2021 09:40 AM
Hi
Please write your own script to realign your relationships according to domain schema design ( currently I don't have your domain model schema
specification ) constraints ( ID unique and NOT NULL constraint). If there are no such constraints then you can form your relationships yourself.
Thanking you
Sameer S Gijare
All the sessions of the conference are now available online