Karate Club: An API Oriented Open-source Python Framework for Unsupervised Learning on Graphs (CIKM 2020)

Benedek Rozemberczki

Last update: Jan 7, 2023

Related tags

Deep Learning machine-learning sklearn community-detection network-science deepwalk networkx supervised-learning louvain unsupervised-learning network-embedding scikit label-propagation gcn graph-clustering node2vec networkx-graph graph-embedding graph2vec node-embedding 2vec

Overview

Karate Club is an unsupervised machine learning extension library for NetworkX.

Please look at the Documentation, relevant Paper, Promo Video, and External Resources.

Karate Club consists of state-of-the-art methods to do unsupervised learning on graph structured data. To put it simply it is a Swiss Army knife for small-scale graph mining research. First, it provides network embedding techniques at the node and graph level. Second, it includes a variety of overlapping and non-overlapping community detection methods. Implemented methods cover a wide range of network science (NetSci, Complenet), data mining (ICDM, CIKM, KDD), artificial intelligence (AAAI, IJCAI) and machine learning (NeurIPS, ICML, ICLR) conferences, workshops, and pieces from prominent journals.

The newly introduced graph classification datasets are available at SNAP, TUD Graph Kernel Datasets, and GraphLearning.io.

Citing

If you find Karate Club and the new datasets useful in your research, please consider citing the following paper:

@inproceedings{karateclub,
       title = {{Karate Club: An API Oriented Open-source Python Framework for Unsupervised Learning on Graphs}},
       author = {Benedek Rozemberczki and Oliver Kiss and Rik Sarkar},
       year = {2020},
       pages = {3125–3132},
       booktitle = {Proceedings of the 29th ACM International Conference on Information and Knowledge Management (CIKM '20)},
       organization = {ACM},
}

A simple example

Karate Club makes the use of modern community detection techniques quite easy (see here for the accompanying tutorial). For example, this is all it takes to use on a Watts-Strogatz graph Ego-splitting:

import networkx as nx
from karateclub import EgoNetSplitter

g = nx.newman_watts_strogatz_graph(1000, 20, 0.05)

splitter = EgoNetSplitter(1.0)

splitter.fit(g)

print(splitter.get_memberships())

Models included

In detail, the following community detection and embedding methods were implemented.

Overlapping Community Detection

DANMF from Ye et al.: Deep Autoencoder-like Nonnegative Matrix Factorization for Community Detection (CIKM 2018)
M-NMF from Wang et al.: Community Preserving Network Embedding (AAAI 2017)
Ego-Splitting from Epasto et al.: Ego-splitting Framework: from Non-Overlapping to Overlapping Clusters (KDD 2017)
NNSED from Sun et al.: A Non-negative Symmetric Encoder-Decoder Approach for Community Detection (CIKM 2017)
BigClam from Yang and Leskovec: Overlapping Community Detection at Scale: A Nonnegative Matrix Factorization Approach (WSDM 2013)
SymmNMF from Kuang et al.: Symmetric Nonnegative Matrix Factorization for Graph Clustering (SDM 2012)

Non-Overlapping Community Detection

GEMSEC from Rozemberczki et al.: GEMSEC: Graph Embedding with Self Clustering (ASONAM 2019)
EdMot from Li et al.: EdMot: An Edge Enhancement Approach for Motif-aware Community Detection (KDD 2019)
SCD from Prat-Perez et al.: High Quality, Scalable and Parallel Community Detectionfor Large Real Graphs (WWW 2014)
Label Propagation from Raghavan et al.: Near Linear Time Algorithm to Detect Community Structures in Large-Scale Networks (Physics Review E 2007)

Neighbourhood-Based Node Level Embedding

SocioDim from Tang et al.: Relational Learning via Latent Social Dimensions (KDD 2009)
GLEE from Torres et al.: GLEE: Geometric Laplacian Eigenmap Embedding (Journal of Complex Networks 2020)
BoostNE from Li et al.: Multi-Level Network Embedding with Boosted Low-Rank Matrix Approximation (ASONAM 2019)
NodeSketch from Yang et al.: NodeSketch: Highly-Efficient Graph Embeddings via Recursive Sketching (KDD 2019)
Diff2Vec from Rozemberczki and Sarkar: Fast Sequence Based Embedding with Diffusion Graphs (CompleNet 2018)
NetMF from Qiu et al.: Network Embedding as Matrix Factorization: Unifying DeepWalk, LINE, PTE, and Node2Vec (WSDM 2018)
RandNE from Zhang et al.: Billion-scale Network Embedding with Iterative Random Projection (ICDM 2018)
Walklets from Perozzi et al.: Don't Walk, Skip! Online Learning of Multi-scale Network Embeddings (ASONAM 2017)
HOPE from Ou et al.: Asymmetric Transitivity Preserving Graph Embedding (KDD 2016)
GraRep from Cao et al.: GraRep: Learning Graph Representations with Global Structural Information (CIKM 2015)
DeepWalk from Perozzi et al.: DeepWalk: Online Learning of Social Representations (KDD 2014)
Node2Vec from Grover et al.: node2vec: Scalable Feature Learning for Networks (KDD 2016)
NMF-ADMM from Sun and Févotte: Alternating Direction Method of Multipliers for Non-Negative Matrix Factorization with the Beta-Divergence (ICASSP 2014)
Laplacian Eigenmaps from Belkin and Niyogi: Laplacian Eigenmaps and Spectral Techniques for Embedding and Clustering (NIPS 2001)

Structural Node Level Embedding

GraphWave from Donnat et al.: Learning Structural Node Embeddings via Diffusion Wavelets (KDD 2018)
Role2Vec from Ahmed et al.: Learning Role-based Graph Embeddings (IJCAI StarAI 2018)

Attributed Node Level Embedding

FEATHER-N from Rozemberczki et al.: Characteristic Functions on Graphs: Birds of a Feather, from Statistical Descriptors to Parametric Models (CIKM 2020)
AE from Rozemberczki et al.: Multi-Scale Attributed Node Embedding (Arxiv 2019)
MUSAE from Rozemberczki et al.: Multi-Scale Attributed Node Embedding (Arxiv 2019)
FSCNMF from Bandyopadhyay et al.: Fusing Structure and Content via Non-negative Matrix Factorization for Embedding Information Networks (ArXiV 2018)
SINE from Zhang et al.: SINE: Scalable Incomplete Network Embedding (ICDM 2018)
BANE from Yang et al.: Binarized Attributed Network Embedding (ICDM 2018)
TENE from Yang et al.: Enhanced Network Embedding with Text Information (ICPR 2018)
ASNE from Liao et al.: Attributed Social Network Embedding (TKDE 2018)
TADW from Yang et al.: Network Representation Learning with Rich Text Information (IJCAI 2015)

Meta Node Embedding

NEU from Yang et al.: Fast Network Embedding Enhancement via High Order Proximity Approximation (IJCAI 2017)

Graph Level Embedding

FEATHER-G from Rozemberczki et al.: Characteristic Functions on Graphs: Birds of a Feather, from Statistical Descriptors to Parametric Models (CIKM 2020)
IGE from Galland et al.: Invariant Embedding for Graph Classification (ICML 2019 LRGSD Workshop)
LDP from Cai et al.: A Simple Yet Effective Baseline for Non-Attributed Graph Classification (ICLR 2019)
GeoScattering from Gao et al.: Geometric Scattering for Graph Data Analysis (ICML 2019)
GL2Vec from Chen and Koga: GL2Vec: Graph Embedding Enriched by Line Graphs with Edge Features (ICONIP 2019)
NetLSD from Tsitsulin et al.: NetLSD: Hearing the Shape of a Graph (KDD 2018)
SF from de Lara and Pineau: A Simple Baseline Algorithm for Graph Classification (NeurIPS RRL Workshop 2018)
FGSD from Verma and Zhang: Hunt For The Unique, Stable, Sparse And Fast Feature Learning On Graphs (NeurIPS 2017)
Graph2Vec from Narayanan et al.: Graph2Vec: Learning Distributed Representations of Graphs (MLGWorkshop 2017)

Head over to our documentation to find out more about installation and data handling, a full list of implemented methods, and datasets. For a quick start, check out our examples.

If you notice anything unexpected, please open an issue and let us know. If you are missing a specific method, feel free to open a feature request. We are motivated to constantly make Karate Club even better.

Installation

Karate Club can be installed with the following pip command.

$ pip install karateclub

As we create new releases frequently, upgrading the package casually might be beneficial.

$ pip install karateclub --upgrade

Running examples

As part of the documentation we provide a number of use cases to show how the clusterings and embeddings can be utilized for downstream learning. These can accessed here with detailed explanations.

Besides the case studies we provide synthetic examples for each model. These can be tried out by running the example scripts. In order to run one of the examples, the Graph2Vec snippet:

$ cd examples/whole_graph_embedding/
$ python graph2vec_example.py

Running tests

$ python setup.py test

License

GNU General Public License v3.0

Comments

GL2vec : RuntimeError: you must first build vocabulary before training the model

Hello, First thanks for your work, it's just great.

However, I have an error while trying to run GL2vec on my dataset, while it works perfectly with the example. Where is exactly this type of error coming from ?

Thanks in advance

opened by hug0prevoteau 14
How to build my own dataset?
I have to build graphs, and following that I have to generate graph embedding.

I checked the documentation i.e. https://karateclub.readthedocs.io/.

But I didn't understand how to build my own graphs.

Can you please point out a sample code where you create dataset from scratch?

I have already checked code here. But they all load pre-defined dataset.

Can you show any code snippet where you create graph i.e. create nodes and add edges.

How to set attributes (features) for the nodes and edges?

Thanks in advance for your help.

I am following the https://karateclub.readthedocs.io/en/latest/notes/installation.html.
opened by smith-co 9
Using Feather-Graph with Node Attributes

Hi @benedekrozemberczki,

Thanks for creating and maintaining this awesome toolbox for graph and node level embedding techniques. I've been using Feather-Graph to embed non-attributed graphs and the results have been fantastic.

Question: I'm working on a new problem where graphs contain nodes with attribute information and I wanted to see if it's possible (or makes sense) to extend Feather-Graph to incorporate node attribute information?

Current thought process: I went through the source code and saw that Feather-Node can leverage an attribute matrix, while Feather-Graph uses the log-degree and clustering coefficient as node features. I felt like there could be an opportunity to plug the feature generation process of Feather-Node into Feather-Graph here, but couldn't determine if there would be any downsides to this approach?

I went through your paper "Characteristic Functions on Graphs..." but wasn't able to come to a decision one way or the other. Hoping you can shed some light on it!

Thanks, Scott

opened by safreita1 8
About GL2vec

Hello, thanks for the awesome work!!

It seems that there are 2 mistakes in the implementation of GL2vec module.

The first one is :

in the code below, """"""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""" def _create_line_graph(self, graph): r"""Getting the embedding of graphs. Arg types: * graph (NetworkX graph) - The graph transformed to be a line graph. Return types: * line_graph (NetworkX graph) - The line graph of the source graph. """ graph = nx.line_graph(graph) node_mapper = {node: i for i, node in enumerate(graph.nodes())} edges = [[node_mapper[edge[0]], node_mapper[edge[1]]] for edge in graph.edges()] line_graph = nx.from_edgelist(edges) return line_graph """"""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""" when converting graph G to line graph LG, the method "_create_line_graph()" ignores the edge attribute of G. (It means that there will be no node arrtibutes in LG) so consequently, the method "WeisfeilerLehmanHashing" will not use the attribute information, and will always use the structural information (degree) instead.

The second one is :

The GL2vec module only returns the embedding of line graph. But in the original paper of GL2vec, they concatenate the embedding of graph and of line graph.
Then named the framework "GL2vec", which means "Graph and Line graph to vector".

Only use the embedding of line graph for downstream task may lead to worse performance.

We noticed that when applying the embeddings to the graph classification task, (when graph both have node attribute and edge attribute) the performance (accuracy) are as follow: concat(G , LG) > G > LG

Hope it helps :)

opened by cheezy88 8
Classifying with graph2vec model

I'm able to obtain an embedding for a list of NetworkX graphs using graph2vec, and I was wondering if karateclub has a function to make classifications for graphs outside the training set? That is, given my embedding, I want to input a graph outside my original graph list (used in the model) and obtain a list of most similar graphs (something like a "most similar" function).

opened by joseluisfalla 6
How to improve the performance of Graph2Vec model fit function ?
I tried to increase the performance of the Graph2Vec model by using increasing the worker parameter when initializing the model. But it seems that still, the model takes only 1 core to process the fit function.

Is method I have used to assign the workers correct ? Is there another method to improve the performance ?

model = Graph2Vec(workers=28) graphs_list=create_graph_list(graph_df) model.fit(graphs_list) graph_x = model.get_embedding()
opened by 1209973 6
Is consecutive numeric indexing necessary for Graph2Vec?

Thanks for the awesome work, networkx is truly helpful when we are dealing with Graph data structure.

I'm trying to get graph embedding using Graph2vec so that we could compare similarity among graphs. But I'm stuck in this assertion: assert numeric_indices == node_indices, "The node indexing is wrong."

Say if we have two graphs, each node in the graph represents a word. We build a mapping so that we could replace text with number. For example, whenever the word "Library" occurs in any graph, we label it with the number "2". In this case, the indexes inside one graph might not be consecutive because the mapping is created from a number of graphs.

So is it still necessary for enforce consecutive indexing in this case? Or I understand the usage of Graph2Vec wrong?

opened by bdeng3 5
Graph Embeddings using node features and inductivity

Hello,

First of all thank you for this amazing library! I have a serie of small graphs where each node contains features and I am trying to learn graph-level embedding in an unsupervised manner. However, I couldn't find how to load node features in the graphs before feeding them to a graph embedding algorithm. Could you describe the input needed by the algorithms ?

Also, is it possible to generate embedding with some sort of forward function once the models are trained (without retraining the model) ? I.e. does the library support inductivity ?

Thank you!

opened by TrovatelliT 5
graph2vec implementation and graphs with missing nodes

Hi there,

first of all, thanks a lot for developing this, it has potential to simplify in-silico experiments on biological networks and I am grateful for that!

I have a question related to the graph2vec implementation. The requirement of the package for graph notation is that nodes have to be named with integers starting from 0 and have to be consecutive. I am working with a collection of 9.000 small networks and would like to embed all of them into an N-dimensional space. Now, all those networks consist of about 25.000 nodes but in some networks these nodes (here it's really genes) are missing (not all genes are supposed to be present in all networks).

If I rename all my nodes from actual gene names to integers and know that some networks don't have all the genes, I will end up with some networks without consecutive node names, e.g. there will be (..), 20, 21, 24, 25, (...) in one network and perhaps (...), 20, 21, 22, 24, 25, (...) in another. That would violate the requirement of being consecutive.

My question is: is the implementation aware that a node 25 is the same object between the different networks? Or is it not important and in reality the embedding only takes into account the structure only and I should 'rename' all my networks separately to keep the node naming consecutive?

opened by kajocina 5

Multithreading for WL hasing function

Hi!

Maybe just another suggestion. In the embedding algorithms, the WeisfeilerLehmanHashing function in the fit function could be time-consuming and the WL hashing function for each graph is independent. Therefore, maybe using multhreading from python can speed them up and I modify the code for my application of graph2vec:

==================================

def fit(self, graphs):
    """
    Fitting a Graph2Vec model.

    Arg types:
        * **graphs** *(List of NetworkX graphs)* - The graphs to be embedded.
    """
    pool = ThreadPool(8)
    args_generator = [(graph, self.wl_iterations, self.attributed) for graph in graphs]
    documents = pool.starmap(WeisfeilerLehmanHashing, args_generator)
    pool.close()
    pool.join()
    #documents = [WeisfeilerLehmanHashing(graph, self.wl_iterations, self.attributed) for graph in graphs]
    documents = [TaggedDocument(words=doc.get_graph_features(), tags=[str(i)]) for i, doc in enumerate(documents)]

    model = Doc2Vec(documents,
                    vector_size=self.dimensions,
                    window=0,
                    min_count=self.min_count,
                    dm=0,
                    sample=self.down_sampling,
                    workers=self.workers,
                    epochs=self.epochs,
                    alpha=self.learning_rate,
                    seed=self.seed)

    self._embedding = [model.docvecs[str(i)] for i, _ in enumerate(documents)]

opened by zslwyuan 5

Update requirements
As it stands, setup.py has the following requirements which specify maximum versions:

install_requires = [ "numpy<1.23.0", "networkx<2.7", "decorator==4.4.2", "pandas<=1.3.5" ]

Is there a reason for the maximum versions, such as expired deprecated features used by karateclub? In my personal research, and in using the included test suite via python3 ./setup.py test, I have not encountered issues in upgrading the packages.

$ pip3 install --upgrade --user networkx numpy pandas decorator $ pip3 list | grep "networkx\|numpy\|decorator\|pandas" decorator 5.1.1 networkx 2.8.8 numpy 1.23.5 pandas 1.5.2

Running the tests with these updated package yields the following:

$ cd karateclub/ $ pytest ... 47 passed, 2540 warnings in 210.58s (0:03:30)

Yes, there are lots of warnings. Many are DeprecationWarnings. The current requirements generate 855 warnings.

$ cd karateclub/ $ pip3 install --user . $ pytest ... 47 passed, 855 warnings in 225.49s (0:03:45)

I suppose the question is: even with additional instances of DeprecationWarning, can we bump up the maximum requirements for this package? Or would the community feel better addressing the deprecation issues before continuing?

For context, my motivation is to keep this package current; I'm currently held back (not actually, but per the setup requirements) by this package's maximum requirements. Does anyone have any thoughts?
opened by WhatTheFuzz 4
Parallel BigCLAM Gradient Computation

https://github.com/benedekrozemberczki/karateclub/blob/de27e87a92323326b63949eee76c02f8d282adc4/karateclub/community_detection/overlapping/bigclam.py#L111-L115

I've noticed that the gradient calculation in BigCLAM could be easily parallelized. This should be embarrassingly parallel, doing batches no n_jobs gradient calculation. The only issue would be when some node in the batch is also the neighbor of another node in the same batch, since self._do_updates would have to wait for the entire batch to run and then update the duplicated node embeddings.

This event may be rare if we consider that the graph is sparse and it is unlikely that such "collision" would happen, still, if it did happen, i believe it would not be a huge problem, as long as it does not happen in further iterations (which is even more unlikely)

What do you guys think? I could implement this if you think it'd be safe to incur in the issue i've mentioned above.

opened by AlanGanem 2
Randomness in Laplacian Eigenmaps Embeddings
Hi! I'm using Laplacian Eigenmaps and noticed that the resulting embeddings are not always the same, even though I have explicitly set the seed:

model = LaplacianEigenmaps(dimensions=3,seed=0)

Running the same algorithm in the same python session for multiple times yields different embeddings each time. Here is a minimal reproducible example:

import networkx as nx g_undirected = nx.newman_watts_strogatz_graph(1000, 20, 0.05, seed=1) from karateclub.node_embedding.neighbourhood import LaplacianEigenmaps import numpy as np for _ in range(5): model = LaplacianEigenmaps(dimensions=3,seed=0) model.fit(g_undirected) node_emb_le = model.get_embedding() print(np.sum(node_emb_le))

It yields the following summed value of the embeddings for me:

31.647046936812927 -31.647046936812888 31.64704693681287 -31.690999529775908 -31.581837545720354

How can I control the randomness so that every time the resulting embeddings are exactly the same, even if I run the algorithm for arbitrary times in the same python session?
opened by wendywangwwt 1

Releases(v_10304)

v_10304(Dec 4, 2022)
What's Changed

Modify test statement to use pytest in lieu of setuptools. by @WhatTheFuzz in https://github.com/benedekrozemberczki/karateclub/pull/119

Update requirements to modern versions. by @WhatTheFuzz in https://github.com/benedekrozemberczki/karateclub/pull/120

New Contributors

@WhatTheFuzz made their first contribution in https://github.com/benedekrozemberczki/karateclub/pull/119

Full Changelog: https://github.com/benedekrozemberczki/karateclub/compare/v_10303...v_10304
Source code(tar.gz)
Source code(zip)
v_10303(Oct 22, 2022)
What's Changed

Implemented first & second-order LINE by @LucaCappelletti94 in https://github.com/benedekrozemberczki/karateclub/pull/114

Full Changelog: https://github.com/benedekrozemberczki/karateclub/compare/v_10302...v_10303
Source code(tar.gz)
Source code(zip)
v_10302(Sep 4, 2022)
What's Changed

Replaced fullargsspec with signature, as it broke in my system by @LucaCappelletti94 in https://github.com/benedekrozemberczki/karateclub/pull/111

Add get_params method to BaseEstimator by @tomlincr in https://github.com/benedekrozemberczki/karateclub/pull/112

New Contributors

@tomlincr made their first contribution in https://github.com/benedekrozemberczki/karateclub/pull/112

Full Changelog: https://github.com/benedekrozemberczki/karateclub/compare/v_10301...v_10302
Source code(tar.gz)
Source code(zip)
v_10301(Aug 13, 2022)
What's Changed

docs: Fix a few typos by @timgates42 in https://github.com/benedekrozemberczki/karateclub/pull/104

Exposed parameter maximum_number_of_iterations by @LucaCappelletti94 in https://github.com/benedekrozemberczki/karateclub/pull/106

Pyre type error fixed. by @luca-digrazia in https://github.com/benedekrozemberczki/karateclub/pull/108

Resolved compatibility issue with sklearn in BoostNE by @LucaCappelletti94 in https://github.com/benedekrozemberczki/karateclub/pull/107

New Contributors

@luca-digrazia made their first contribution in https://github.com/benedekrozemberczki/karateclub/pull/108

Full Changelog: https://github.com/benedekrozemberczki/karateclub/compare/v_10300...v_10301
Source code(tar.gz)
Source code(zip)
v_10300(Jun 4, 2022)

The release adds vector induction (inference) for all of the graph level embedding methods. Including:

Graph2Vec GL2Vec
Source code(tar.gz)
Source code(zip)
v_10204(Jun 3, 2022)
What's Changed

NetworkX version fixed to <2.7 - scipy sparse version change.

Just fixed some warning of upcoming dropped features by @LucaCappelletti94 in https://github.com/benedekrozemberczki/karateclub/pull/93

New Contributors

@LucaCappelletti94 made their first contribution in https://github.com/benedekrozemberczki/karateclub/pull/93

Full Changelog: https://github.com/benedekrozemberczki/karateclub/compare/v_10203...v_10204
Source code(tar.gz)
Source code(zip)
v_10203(Jan 22, 2022)

Full Changelog: https://github.com/benedekrozemberczki/karateclub/compare/v_10202...v_10203
Source code(tar.gz)
Source code(zip)
v_10202(Sep 29, 2021)

Added Wavelet Characteristic from the CIKM 2021 paper: Graph Embedding via Diffusion-Wavelets-Based Node Feature Distribution Characterization
Source code(tar.gz)
Source code(zip)
v_10201(Aug 4, 2021)
Weighted FEATHER algorithm.

Source code(tar.gz)
Source code(zip)
v_10200(Jul 2, 2021)
The new release supports directed and disjoint graphs:

Directed graph support.

Disjoint graph support.

Source code(tar.gz)
Source code(zip)
v_10100(May 19, 2021)
Allows higher version of gensim.

Source code(tar.gz)
Source code(zip)
v_10024(Mar 30, 2021)
Added flag.

Source code(tar.gz)
Source code(zip)
v_10023(Jan 25, 2021)

Release SocioDim.
Source code(tar.gz)
Source code(zip)
v_10022(Dec 1, 2020)

Increased vector count.
Source code(tar.gz)
Source code(zip)
v_10020(Nov 20, 2020)
Added RandomNE

Source code(tar.gz)
Source code(zip)
v_100021(Nov 20, 2020)
Added LDP

Source code(tar.gz)
Source code(zip)
v_10019(Nov 6, 2020)

Fixing the M-NMF.
Source code(tar.gz)
Source code(zip)
v_100018(Nov 5, 2020)
Noise parameter added.

Source code(tar.gz)
Source code(zip)
v_10017(Oct 23, 2020)

Added the AE dataset
Source code(tar.gz)
Source code(zip)
v_10016(Oct 18, 2020)

Added GLEE
Source code(tar.gz)
Source code(zip)
v_10015(Sep 25, 2020)

Added ASNE.
Source code(tar.gz)
Source code(zip)
V_10014(Aug 23, 2020)
The Invariant Graph Embedding paper was added from ICML 2019.

Source code(tar.gz)
Source code(zip)
v_100013(Aug 10, 2020)

Resolved version consistency.
Source code(tar.gz)
Source code(zip)
v_100011(Jul 23, 2020)
Added feature erase.

Added WL fix change.

Source code(tar.gz)
Source code(zip)
V_10010(Jul 22, 2020)

Correct import.
Source code(tar.gz)
Source code(zip)
v_10009(Jul 19, 2020)
Ensured the type safety.

Source code(tar.gz)
Source code(zip)
v_10008(Jul 5, 2020)

Error handling by assertions.
Source code(tar.gz)
Source code(zip)
v_10007(Jun 10, 2020)
Travis CI.

100% Test coverage reached.

Codecov integration.

Source code(tar.gz)
Source code(zip)
v_10006(Jun 3, 2020)
General fix of python random, Scipy, Numpy and Gensim seeding.

Source code(tar.gz)
Source code(zip)
v_10005(May 28, 2020)
Fixed error handling.

Source code(tar.gz)
Source code(zip)

Owner

Benedek Rozemberczki

PhD candidate at The University of Edinburgh @cdt-data-science working on machine learning and data mining related to graph structured data.

GitHub https://karateclub.readthedocs.io

Little Ball of Fur - A graph sampling extension library for NetworKit and NetworkX (CIKM 2020)

Little Ball of Fur is a graph sampling extension library for Python. Please look at the Documentation, relevant Paper, Promo video and External Resour

619 Dec 14, 2022

Source code for CIKM 2021 paper for Relation-aware Heterogeneous Graph for User Profiling

RHGN Source code for CIKM 2021 paper for Relation-aware Heterogeneous Graph for User Profiling Dependencies torch==1.6.0 torchvision==0.7.0 dgl==0.7.1

Big Data and Multi-modal Computing Group, CRIPAC

6 Nov 29, 2022

A PyTorch implementation of "ANEMONE: Graph Anomaly Detection with Multi-Scale Contrastive Learning", CIKM-21

ANEMONE A PyTorch implementation of "ANEMONE: Graph Anomaly Detection with Multi-Scale Contrastive Learning", CIKM-21 Dependencies python==3.6.1 dgl==

Graph Analysis & Deep Learning Laboratory, GRAND

30 Dec 14, 2022

pytorch implementation of "Contrastive Multiview Coding", "Momentum Contrast for Unsupervised Visual Representation Learning", and "Unsupervised Feature Learning via Non-Parametric Instance-level Discrimination"

Unofficial implementation: MoCo: Momentum Contrast for Unsupervised Visual Representation Learning (Paper) InsDis: Unsupervised Feature Learning via N

16 Nov 4, 2020

The story of Chicken for Club Bing

Chicken Story tl;dr: The time when Microsoft banned my entire country for cheating at Club Bing. (A lot of the details are from memory so I've recreat

142 May 16, 2022

Doge-Prediction - Coding Club prediction ig

Doge-Prediction Coding Club prediction ig Basically: Create an application that

1 Jan 10, 2022

[CIKM 2019] Code and dataset for "Fi-GNN: Modeling Feature Interactions via Graph Neural Networks for CTR Prediction"

FiGNN for CTR prediction The code and data for our paper in CIKM2019: Fi-GNN: Modeling Feature Interactions via Graph Neural Networks for CTR Predicti

75 Dec 30, 2022

Code for the CIKM 2019 paper "DSANet: Dual Self-Attention Network for Multivariate Time Series Forecasting".

Dual Self-Attention Network for Multivariate Time Series Forecasting 20.10.26 Update: Due to the difficulty of installation and code maintenance cause

223 Dec 16, 2022

Codes for CIKM'21 paper 'Self-Supervised Graph Co-Training for Session-based Recommendation'.

COTREC Codes for CIKM'21 paper 'Self-Supervised Graph Co-Training for Session-based Recommendation'. Requirements: Python 3.7, Pytorch 1.6.0 Best Hype

42 Dec 9, 2022

G-NIA model from "Single Node Injection Attack against Graph Neural Networks" (CIKM 2021)

Single Node Injection Attack against Graph Neural Networks This repository is our Pytorch implementation of our paper: Single Node Injection Attack ag

18 Nov 21, 2022

The implementation of our CIKM 2021 paper titled as: "Cross-Market Product Recommendation"

FOREC: A Cross-Market Recommendation System This repository provides the implementation of our CIKM 2021 paper titled as "Cross-Market Product Recomme

16 Sep 12, 2022

FEDn is an open-source, modular and ML-framework agnostic framework for Federated Machine Learning

FEDn is an open-source, modular and ML-framework agnostic framework for Federated Machine Learning (FedML) developed and maintained by Scaleout Systems. FEDn enables highly scalable cross-silo and cross-device use-cases over FEDn networks.

75 Nov 9, 2022

ATOMIC 2020: On Symbolic and Neural Commonsense Knowledge Graphs

(Comet-) ATOMIC 2020: On Symbolic and Neural Commonsense Knowledge Graphs Paper Jena D. Hwang, Chandra Bhagavatula, Ronan Le Bras, Jeff Da, Keisuke Sa

152 Dec 27, 2022

PaddleRobotics is an open-source algorithm library for robots based on Paddle, including open-source parts such as human-robot interaction, complex motion control, environment perception, SLAM positioning, and navigation.

简体中文 | English PaddleRobotics paddleRobotics是基于paddle的机器人开源算法库集，包括人机交互、复杂运动控制、环境感知、slam定位导航等开源算法部分。人机交互主动多模交互技术TFVT-HRI 主动多模交互技术是通过视觉、语音、触摸传感器等输入机器人

185 Dec 26, 2022

《Improving Unsupervised Image Clustering With Robust Learning》(2020)

Improving Unsupervised Image Clustering With Robust Learning This repo is the PyTorch codes for "Improving Unsupervised Image Clustering With Robust L

129 Dec 27, 2022

MazeRL is an application oriented Deep Reinforcement Learning (RL) framework

MazeRL is an application oriented Deep Reinforcement Learning (RL) framework, addressing real-world decision problems. Our vision is to cover the complete development life cycle of RL applications ranging from simulation engineering up to agent development, training and deployment.

222 Dec 24, 2022

ZSL-KG is a general-purpose zero-shot learning framework with a novel transformer graph convolutional network (TrGCN) to learn class representation from common sense knowledge graphs.

ZSL-KG is a general-purpose zero-shot learning framework with a novel transformer graph convolutional network (TrGCN) to learn class representa

94 Nov 21, 2022

PyKale is a PyTorch library for multimodal learning and transfer learning as well as deep learning and dimensionality reduction on graphs, images, texts, and videos

PyKale is a PyTorch library for multimodal learning and transfer learning as well as deep learning and dimensionality reduction on graphs, images, texts, and videos. By adopting a unified pipeline-based API design, PyKale enforces standardization and minimalism, via reusing existing resources, reducing repetitions and redundancy, and recycling learning models across areas.

370 Dec 27, 2022

《Unsupervised 3D Human Pose Representation with Viewpoint and Pose Disentanglement》(ECCV 2020) GitHub: [fig9]

Unsupervised 3D Human Pose Representation [Paper] The implementation of our paper Unsupervised 3D Human Pose Representation with Viewpoint and Pose Di

42 Nov 24, 2022

Karate Club: An API Oriented Open-source Python Framework for Unsupervised Learning on Graphs (CIKM 2020)

Related tags

Overview

Comments

Releases(v_10304)

v_10304(Dec 4, 2022)

What's Changed

New Contributors

v_10303(Oct 22, 2022)

What's Changed

v_10302(Sep 4, 2022)

What's Changed

New Contributors

v_10301(Aug 13, 2022)

What's Changed

New Contributors

v_10300(Jun 4, 2022)

v_10204(Jun 3, 2022)

What's Changed

New Contributors

v_10203(Jan 22, 2022)

v_10202(Sep 29, 2021)

v_10201(Aug 4, 2021)

v_10200(Jul 2, 2021)

v_10100(May 19, 2021)

v_10024(Mar 30, 2021)

v_10023(Jan 25, 2021)

v_10022(Dec 1, 2020)

v_10020(Nov 20, 2020)

v_100021(Nov 20, 2020)

v_10019(Nov 6, 2020)

v_100018(Nov 5, 2020)

v_10017(Oct 23, 2020)

v_10016(Oct 18, 2020)

v_10015(Sep 25, 2020)

V_10014(Aug 23, 2020)

v_100013(Aug 10, 2020)

v_100011(Jul 23, 2020)

V_10010(Jul 22, 2020)

v_10009(Jul 19, 2020)

v_10008(Jul 5, 2020)

v_10007(Jun 10, 2020)

v_10006(Jun 3, 2020)

v_10005(May 28, 2020)

Owner

Benedek Rozemberczki

Little Ball of Fur - A graph sampling extension library for NetworKit and NetworkX (CIKM 2020)

Source code for CIKM 2021 paper for Relation-aware Heterogeneous Graph for User Profiling

A PyTorch implementation of "ANEMONE: Graph Anomaly Detection with Multi-Scale Contrastive Learning", CIKM-21

pytorch implementation of "Contrastive Multiview Coding", "Momentum Contrast for Unsupervised Visual Representation Learning", and "Unsupervised Feature Learning via Non-Parametric Instance-level Discrimination"

The story of Chicken for Club Bing

Doge-Prediction - Coding Club prediction ig

[CIKM 2019] Code and dataset for "Fi-GNN: Modeling Feature Interactions via Graph Neural Networks for CTR Prediction"

Code for the CIKM 2019 paper "DSANet: Dual Self-Attention Network for Multivariate Time Series Forecasting".

Codes for CIKM'21 paper 'Self-Supervised Graph Co-Training for Session-based Recommendation'.

G-NIA model from "Single Node Injection Attack against Graph Neural Networks" (CIKM 2021)

The implementation of our CIKM 2021 paper titled as: "Cross-Market Product Recommendation"

FEDn is an open-source, modular and ML-framework agnostic framework for Federated Machine Learning

ATOMIC 2020: On Symbolic and Neural Commonsense Knowledge Graphs

PaddleRobotics is an open-source algorithm library for robots based on Paddle, including open-source parts such as human-robot interaction, complex motion control, environment perception, SLAM positioning, and navigation.

《Improving Unsupervised Image Clustering With Robust Learning》(2020)

MazeRL is an application oriented Deep Reinforcement Learning (RL) framework

ZSL-KG is a general-purpose zero-shot learning framework with a novel transformer graph convolutional network (TrGCN) to learn class representation from common sense knowledge graphs.

PyKale is a PyTorch library for multimodal learning and transfer learning as well as deep learning and dimensionality reduction on graphs, images, texts, and videos

《Unsupervised 3D Human Pose Representation with Viewpoint and Pose Disentanglement》(ECCV 2020) GitHub: [fig9]