DMBDA 2025 Abstracts


Area 1 - DMBDA

Full Papers
Paper Nr: 5
Title:

A Nested Structure of Anomalies in Academic Publication Citations

Authors:

Renata Avros, Gal Farfel and Zeev Volkovich

Abstract: The article presents a novel approach to detecting nested anomalies in citation networks. These anomalies, as irregularities within citation patterns, significantly threaten the reliability of academic research. Traditional methods for anomaly detection often study the entire citation graph, missing abnormalities within specific subfields or research clusters. Unlike these methods, our approach delves deeper by examining articles within the citation network at different nested scales. Such an approach allows anomalies that might be missed to be uncovered by focusing on a single level, revealing hidden patterns across various granularities, detecting a broader spectrum of nested irregularities, and offering a more nuanced understanding of how citation patterns deviate from the expected. The presented approach supports identifying potential issues, such as citation manipulation, and uncovering emerging trends within the network. The delivered numerical experiments also demonstrate the method's ability to estimate the consistency of the dataset structure.

Paper Nr: 7
Title:

Citation Steadiness Analysis with GraphSAGE Approach

Authors:

Renata Avros, Dvora Toledano Kitai and Zeev Volkovich

Abstract: Citation manipulation occurs when references are deliberately included in academic works for reasons unrelated to their genuine scholarly merit. Instead of serving their primary purposes—such as supporting arguments, providing context, or guiding readers—these citations are often utilized to inflate metrics like citation counts artificially. Manipulated citations tend to deviate from the standard patterns and structures found in authentic citation networks. Consequently, when such networks are perturbed by removing certain nodes or connections, these manipulated citations are more likely to exhibit inconsistencies. This paper introduces a method for detecting citation manipulation by studying how citation patterns change under random perturbations of the citation graph. The method employs the GraphSAGE algorithm to generate embeddings of the altered graph in an Euclidean space, thereby reconstructing the removed edges. The approach assumes that legitimate citations are bolstered by a network of indirect connections, leading to closely related embeddings for nodes linked by authentic citations that facilitate the accurate prediction of missing edges. By iteratively perturbing the graph and assessing the accuracy of edge reconstruction, the method highlights suspected manipulated citations, which consistently exhibit poor reconstruction performance, signifying supposed anomalous comportment. Numerical experiments validate the effectiveness of this approach in identifying anomalies within citation networks, highlighting its potential as a reliable tool for enhancing the integrity of scholarly communication.

Paper Nr: 8
Title:

Appraisal of Citation Reliability Using a Gan-Based Approach

Authors:

Dvora Toledano Kitai, Renata Avros, Ilya Lev, Biran Fridman and Zeev Volkovich

Abstract: This paper addresses the pressing issue of citation manipulation in academic publications. Traditional detection methods, which rely on expert manual review, struggle to keep pace with the ever-growing volume of research output. To overcome these limitations, this study introduces an automated, network-based approach for identifying unreliable citations using an Encoder-Decoder model. By learning regular citation patterns, the model detects anomalies through reconstruction errors. Citation reliability is assessed by systematically removing edges from a citation network and predicting their reinstatement using a modified GAN-based framework. Successful predictions indicate legitimate citations, while failures suggest potential manipulation. The proposed methodology is validated on the CORA dataset, demonstrating its effectiveness in distinguishing genuine references from manipulated ones. This approach provides a scalable and data-driven solution for enhancing research integrity and mitigating citation distortions in scholarly literature.