Head's Up! These forums are read-only. All users and content have migrated. Please join us at community.neo4j.com.
11-01-2022 06:47 PM
I am using 5.1.0 with a single node(docker), running on 16 vCPUs, 64GB RAM. I have a query which retrieves upto 5 degrees of connections for a given node. The result has 72K records and the query latency is about 78 seconds. My configurations to the docker container are as below. The call dbms.listConfig() does list the same values, so that is consistent.
11-01-2022 07:57 PM
Can you share the query?
11-01-2022 11:34 PM
Thanks glileinfield. The query as below :
11-02-2022 05:02 AM
@spraja08 have you tried profiling the query to see the plan, page cache hits, etc?
11-02-2022 05:10 PM
Thanks steggy. I have not and will explore that now...
11-07-2022 05:45 AM
Hi Steggy,
I have attached the query profile output. I noticed that the PageCache is not used at all. I expect that the ProduceResult operator at least must use the Cache rather than going for 100s of thousands of dbhits... Looks like this is where the performance is lost. Would you kindly help with any recommendations on how to get the engine to use PageCache? Much appreciate it.
11-07-2022 10:23 AM
Does it change if you execute the query a second time?
11-07-2022 04:09 PM
Nope. It is always consistent. No change to the dbhits count even if the same query is run n number of times consecutively. I am deeply puzzled...
11-07-2022 05:54 PM - edited 11-07-2022 05:54 PM
Hi @spraja08 ,
Does your graph contain any cycles? How long does the query takes when you run solely the count of paths? Was this behavior different on previous Neo4J versions? What are you planning to do with this paths? Considering the relationships had no type.
Bennu
11-08-2022 08:11 PM
Thanks bennu. The relationships has a property called "type". The graph has cycles but in the cyclic path, the relationship property "type" has different values. So these cycles are valid.
If I solely count the paths, the query is very efficient. This goes to prove that the last operator (produceResults) creates a lot of dbhits, which contributes to the high latency. I wonder if this operator is not written to leverage the pagecache ?!
The paths is the key insight that the consumers will utilise in business scenarios.
11-09-2022 12:59 AM
Hi @spraja08,
How are you measuring the time execution? Are you checking the query.log file?
All the sessions of the conference are now available online