GREASE : graph imbalance reduction by adding sets of edges
- Author
- Yoosof Mashayekhi (UGent) , Bo Kang (UGent) , Jefrey Lijffijt (UGent) and Tijl De Bie (UGent)
- Organization
- Project
-
- Formalizing Subjective Interestingness in Exploratory Data Mining
- Fair, Effective, and Sustainable Talent Management using Conditional Network Embedding
- Onderzoeksprogramma Artificiële Intelligentie - 2023
- Data Mining without Spilling the Beans: Preserving more than Privacy Alone
- Exploring Data: Theoretical Foundations and Applications to Web, Multimedia, and Omics Data
- Conditional Knowledge Graph Embedding
- Flanders Artificial Intelligence Research program (FAIR) – second cycle - 2024
- Abstract
- Real-world data can often be represented as a heterogeneous network relating nodes of different types. E.g., a job market can be represented as a job seeker-skill-vacancy network. It can be relevant to consider the imbalance between nodes of different types, in terms of whether they are similarly connected in the network. For example, it is desirable that job seekers and vacancies are mixed well. If they are not, then there is imbalance. We propose to quantify the imbalance between two sets of nodes in a network as the Earth Mover's Distance between the sets. Given this quantification, we introduce GREASE (Graph imbalance REduction by Adding Sets of Edges), a method that selects a fixed number of unconnected node-pairs, which—if links were added between them—aims to maximally reduce the imbalance. In the job market network, GREASE can be used to select skills that job seekers do not yet have, but could strive to acquire, to reduce the imbalance between job seekers and vacancies. GREASE may also be used in other applications, such as reducing controversy between opposing sides on a polarizing topic. We evaluated GREASE on several datasets and find that GREASE outperforms baselines in reducing network imbalance.
- Keywords
- Computational Theory and Mathematics, Computer Science Applications, Information Systems, representation learning, network embedding, imbalance reduction, graph algorithms, Graphs and networks, Programming, Task analysis, Java, Python, Training, Earth, Costs
Downloads
-
grease ieee.pdf
- full text (Accepted manuscript)
- |
- open access
- |
- |
- 1.22 MB
-
(...).pdf
- full text
- |
- UGent only
- |
- |
- 1.46 MB
Citation
Please use this url to cite or link to this publication: http://hdl.handle.net/1854/LU-01HDDS8JX17J7S4DVYXA2WHAMA
- MLA
- Mashayekhi, Yoosof, et al. “GREASE : Graph Imbalance Reduction by Adding Sets of Edges.” IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, vol. 36, no. 4, 2024, pp. 1611–23, doi:10.1109/tkde.2023.3304478.
- APA
- Mashayekhi, Y., Kang, B., Lijffijt, J., & De Bie, T. (2024). GREASE : graph imbalance reduction by adding sets of edges. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 36(4), 1611–1623. https://doi.org/10.1109/tkde.2023.3304478
- Chicago author-date
- Mashayekhi, Yoosof, Bo Kang, Jefrey Lijffijt, and Tijl De Bie. 2024. “GREASE : Graph Imbalance Reduction by Adding Sets of Edges.” IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING 36 (4): 1611–23. https://doi.org/10.1109/tkde.2023.3304478.
- Chicago author-date (all authors)
- Mashayekhi, Yoosof, Bo Kang, Jefrey Lijffijt, and Tijl De Bie. 2024. “GREASE : Graph Imbalance Reduction by Adding Sets of Edges.” IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING 36 (4): 1611–1623. doi:10.1109/tkde.2023.3304478.
- Vancouver
- 1.Mashayekhi Y, Kang B, Lijffijt J, De Bie T. GREASE : graph imbalance reduction by adding sets of edges. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING. 2024;36(4):1611–23.
- IEEE
- [1]Y. Mashayekhi, B. Kang, J. Lijffijt, and T. De Bie, “GREASE : graph imbalance reduction by adding sets of edges,” IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, vol. 36, no. 4, pp. 1611–1623, 2024.
@article{01HDDS8JX17J7S4DVYXA2WHAMA, abstract = {{Real-world data can often be represented as a heterogeneous network relating nodes of different types. E.g., a job market can be represented as a job seeker-skill-vacancy network. It can be relevant to consider the imbalance between nodes of different types, in terms of whether they are similarly connected in the network. For example, it is desirable that job seekers and vacancies are mixed well. If they are not, then there is imbalance. We propose to quantify the imbalance between two sets of nodes in a network as the Earth Mover's Distance between the sets. Given this quantification, we introduce GREASE (Graph imbalance REduction by Adding Sets of Edges), a method that selects a fixed number of unconnected node-pairs, which—if links were added between them—aims to maximally reduce the imbalance. In the job market network, GREASE can be used to select skills that job seekers do not yet have, but could strive to acquire, to reduce the imbalance between job seekers and vacancies. GREASE may also be used in other applications, such as reducing controversy between opposing sides on a polarizing topic. We evaluated GREASE on several datasets and find that GREASE outperforms baselines in reducing network imbalance.}}, author = {{Mashayekhi, Yoosof and Kang, Bo and Lijffijt, Jefrey and De Bie, Tijl}}, issn = {{1041-4347}}, journal = {{IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING}}, keywords = {{Computational Theory and Mathematics,Computer Science Applications,Information Systems,representation learning,network embedding,imbalance reduction,graph algorithms,Graphs and networks,Programming,Task analysis,Java,Python,Training,Earth,Costs}}, language = {{eng}}, number = {{4}}, pages = {{1611--1623}}, title = {{GREASE : graph imbalance reduction by adding sets of edges}}, url = {{http://doi.org/10.1109/tkde.2023.3304478}}, volume = {{36}}, year = {{2024}}, }
- Altmetric
- View in Altmetric
- Web of Science
- Times cited: