Head's Up! These forums are read-only. All users and content have migrated. Please join us at community.neo4j.com.
07-30-2022 07:53 PM
Hi everyone,
I'm new to Neo4j, and have been using Spark for ETL.
I have been playing with Neo4j Community version for the past two weeks. However, when the nodes reach 1 billion nodes with multiple relationships, the query is very slow, especially in streaming applications. Therefore, our team is considering utilizing GraphX of Spark for ETL and push the sub-graph to Neo4J. Basically, we want to use graphX to process any node creation, cypher query, run graph algorithms, ML, and just push the final results/sub-graphs to Neo4j for visualization.
From my online research, there used to be a project called "Mazerunner", that aimed to do what I need. However, the project seemed to stop in 2015. Its usage seemed to be limited. Besides, Neo4j used to list Mazerunner on its website as one of the recommended APIs to connect Neo4j and Spark. However, that has been removed from Neo4J website, too. It seemed that Mazerunner is no longer supported. My questions are:
Thank you so much for your support!
All the sessions of the conference are now available online