Departamento de Inteligencia Artificial, Universidad Politécnica de Madrid. 28660 Madrid, Spain
Predicting citation count of Bioinformatics papers
within four years of publication
RESULTS


     We will build two different types of prediction models: Global models and Specific models. Global models attempt to predict the number of citations received by an article within each of one the next four years after publication, using information of all papers published in Bioinformatics journal during three years, between January 1, 2005 and December 31, 2007. Specific models have the same objective as global models, but in this case, they use the information related to articles published within a specific Bioinformatics journal section.

     Paradigms used in this work are: Bayesian networks (naiveBayes and K2), logistic regression, decision trees (C4.5) and k-nearest neighbors (5-NN).


General models

        Percentage of well classified cases within the next four years after publication and confusion matrices.   (Table)


Specific models

        Average accuracy within the four prediction years by each section and paradigm. (Table, Figure).
        Average accuracy within the nine publication journal sections by each prediction year and paradigm. (Table, Figure).

        Structure models:

            Bayesian networks (naiveBayes):            Section 1 - First-year                Section 2 - First-year
           
            Bayesian networks (K2):                        Section 2 - Fourth-year             Section 4 - First-year

            Decision trees (C4.5)                             Section 4 - Second-year            Section 8 - Fourth-year

      
        Accuracy and standard deviation. Numbers in boldface represent an average success rate better than 95%.


             



Section 1
Section 2
Section 3
Section 4
Section 5
Section 6
Section 7
Section 8
Section 9


First year
NB

K2

LR

C4.5

K-NN

------------
---------

--------------------
---------------------
----------------------
----------------------
----------------------
----------------------
----------------------
----------------------
----------------------


Second year
NB

K2

LR

C4.5

K-NN

------------
---------

--------------------
---------------------
----------------------
----------------------
----------------------
----------------------
----------------------
----------------------
----------------------


Third year
NB

K2

LR

C4.5

K-NN

------------
---------

--------------------
---------------------
----------------------
----------------------
----------------------
----------------------
----------------------
----------------------
----------------------


Fourth
year
NB

K2

LR

C4.5

K-NN

HOME            DATA            RESULTS            EXPLOITING BEST MODELS