Head's Up! These forums are read-only. All users and content have migrated. Please join us at community.neo4j.com.
03-10-2020 11:46 AM
I have a 3-node kubernetes cluster where I deploy a 3-core neo4j cluster using the public neo4j chart: https://github.com/helm/charts/tree/master/stable/neo4j
The 3-core neo would come up correctly using the DNS discovery type through the default headless service.
What I have been trying to achieve however is to have cluster members discover themselves through the NodePort services as a proof of concept that a neo cluster can be formed over two kubernetes clusters. However, each neo cluster member has been struck in stage: "Waiting for 3 members. Currently discovered 0 members: {}" and kept restarting after the timeout. Anyone see any problem with this approach and any suggestion to resolve this would be greatly appreciated. Thanks.
Below are what have been created/configured (btw, I tried using different node-ip for each initial_discovery_member and it did not help):
3 NodePort services for discovery:
3 NodePort services for transaction:
3 NodePort services for raft:
And in the core-statefulset.yaml
Below is the container core-0 log :
2020-03-10 18:33:20.501+0000 INFO ======== Neo4j 3.4.5 ========
2020-03-10 18:33:20.551+0000 INFO Starting...
2020-03-10 18:33:22.478+0000 INFO Initiating metrics...
2020-03-10 18:33:22.622+0000 INFO My connection info: [
Discovery: listen=0.0.0.0:5000, advertised=192.168.96.9:31010,
Transaction: listen=0.0.0.0:6000, advertised=192.168.96.9:31020,
Raft: listen=0.0.0.0:7000, advertised=192.168.96.9:31030,
Client Connector Addresses: bolt://neo4j-neo4j-core-0.neo4j-neo4j.default.svc.cluster.local:7687,http://neo4j-neo4j-core-0.neo4j-neo4j.default.svc.cluster.local:7474,https://neo4j-neo4j-core-0.neo4j-neo4j.default.svc.cluster.local:7473
]
2020-03-10 18:33:22.623+0000 INFO Discovering cluster with initial members: [192.168.96.9:31010, 192.168.96.9:31011, 192.168.96.9:31012]
2020-03-10 18:33:22.623+0000 INFO Attempting to connect to the other cluster members before continuing...
2020-03-10 18:38:24.996+0000 ERROR Failed to start Neo4j: Starting Neo4j failed: Component 'org.neo4j.server.database.LifecycleManagingDatabase@50825a02' was successfully initialized, but failed to start. Please see the attached cause exception "Failed to join a cluster with members {clusterId=null, bootstrappable=false, coreMembers={}}. Another member should have published a clusterId but none was detected. Please restart the cluster.". Starting Neo4j failed: Component 'org.neo4j.server.database.LifecycleManagingDatabase@50825a02' was successfully initialized, but failed to start. Please see the attached cause exception "Failed to join a cluster with members {clusterId=null, bootstrappable=false, coreMembers={}}. Another member should have published a clusterId but none was detected. Please restart the cluster.".
And the one of the discovery service:
Name: neo4j-neo4j-discovery-0
Namespace: default
Labels: <none>
Annotations: <none>
Selector: statefulset.kubernetes.io/pod-name=neo4j-neo4j-core-0
Type: NodePort
IP: 10.233.36.132
Port: discovery-0 5000/TCP
TargetPort: 5000/TCP
NodePort: discovery-0 31010/TCP
Endpoints:
Session Affinity: None
External Traffic Policy: Cluster
Events: <none>
All the sessions of the conference are now available online