Head's Up! These forums are read-only. All users and content have migrated. Please join us at community.neo4j.com.
08-15-2022 06:43 AM
I am currently looking into using gdsl for a node classification task in which context i have three questions.
1. Is it possible to use relationships instead of properties to train a model ?
2. Is it only possible to use numbers to train a model or can strings be used ?
(regarding those two questions: I would have to use keywords which are each represented as a separate node which are connected to another node based on a relationship. If it is not possible to use relationships and strings a would probably have to create properties for them and represent them as a number )
3. Is multi-class classification possible ? Because a node could be classified as more than one class so more than one relationship would have to be created. (Example: Person could be overweight and sick in any other way at the same time)
08-15-2022 07:44 AM
If you use strings you need to categorize them e.g. via one-hot-encoding into numbers.
https://neo4j.com/docs/graph-data-science/current/algorithms/auxiliary/
How would you want to use relationships? Tomaz had some examples for graph feature extraction that turn relationship data into attributes on the projected graph
https://bratanic-tomaz.medium.com/
Afaik you can use relationship-weight properties for your training.
Not sure about multi-classification
08-18-2022 12:56 AM
Thanks for your answer, I looked into One Hot Encoding and it could be helpful but an alpha state is probably not the best solution for production later on. But as mentioned further down in the comments there are also node embeddings.
Towards the relationships, basically I have keywords and one idea was to represent them each as an node so there would be relationships between them and the node the keywords belong to. Primarily based on those keywords i have to do a node classification.
But I am slowly coming to the conclusion that it would be better to use python for that matter (or Java if possible)
08-15-2022 11:59 PM
3. Unfortunately we do not support multi-label classification at this time. But it's a very interesting feature request and I will add it as a candidate to our internal product roadmap.
08-18-2022 12:57 AM
Thanks for the answer, now i know for sure.
08-16-2022 12:14 AM
To use relationships for your node classification, I would suggest to look into Node embeddings (https://neo4j.com/docs/graph-data-science/current/machine-learning/node-embeddings/).
08-18-2022 12:59 AM
Thanks for the answer. I already seen that in the X-Men example regarding node classification and i will look further into it.
All the sessions of the conference are now available online