Head's Up! These forums are read-only. All users and content have migrated. Please join us at community.neo4j.com.
01-31-2022 05:25 AM
Hi, I am working on my master thesis and I have a few questions related to performance and effectiveness. My database consists of data from an identity management system, where basically users and their roles are stored.
So, my task is to add new information about when was the role used. I need to store all timestamps when a specific user accessed some resource using that role.
Despite I have just a small amount of data, I later need to test my solution on a big dataset and use some graph algorithms which will be traversing the whole graph.
My question is which option can be more effective and provide more performance for graph traversal?
02-01-2022 04:40 AM
The short answer is that you're likely going to be better off with modeling each access as its own node (and not a set of properties on a relationship) because later you may want to assert properties about the access itself, and you'll want to take advantage of indexing.
So instead of:
(:Role)<-[:HAS]-(:User)-[:ACCESSED]->(:Resource)
You might consider
(:Role)<-[:HAS]-(:User)-[:access]->(:Access { date, IP, etc }]-[:resource]->(:Resource)
The long answer on why and what the tradeoffs are you can find here: Graph Data Modeling: All About Relationships | by David Allen | Neo4j Developer Blog | Medium
All the sessions of the conference are now available online