Have a personal or library account? Click to login
A novel approach based on journal coupling to determine authors who are most likely to be part of the same invisible college Cover

A novel approach based on journal coupling to determine authors who are most likely to be part of the same invisible college

Open Access
|Dec 2024

Full Article

1
Introduction

Borgman and Furner (2002) highlighted the importance of choosing the publication venue for scholars, “matching the topic of the paper with the scope statement of the journal and the topicality of previous articles published.” However, authors and their publications determine the creation of intellectual knowledge and social identity of those same journals where they publish their work. This is because by publishing in a certain journal, scholars become active participants in the group of authors associated with the journal. It is then that their academic identity is added to that of the other authors who have also published there, contributing to the thematic configuration of that publication venue. In the literature there are several definitions of the academic identity of an author. For example, White (2001) defined an author’s citation identity as the set of authors that an author cites. Assuming an author has been cited at all, White and McCain (1998) also defined an author’s citation image as the set of all authors with whom an author has been co-cited.

In this scenario, the grouping of authors according to the shared academic journals can provide evidence for the existence of social and intellectual communities of scholars. Price and Beaver (1966) introduced the concept of “invisible college”, highlighting the importance of the social grouping of scholars. The basic phenomenon behind invisible colleges is that in the most active and competitive research fields there seems to exist an “in-group” of academics. Authors in this subgroup contribute materially to research on the same topic and often publish their work in similar publication venues. As suggested in (Price & Beaver, 1966), the problem is that although it is relatively easy to find a well-known scholar in a chosen field of research, it is considerably more difficult to select a group of authors who make up the majority of a single invisible college. The basic difficulty is capturing and dissecting that group of scholars within a specific field of research or discipline (Price & Beaver, 1966).

In this paper, following these ideas, we propose a clustering of authors according to their journal publication profile. Using this approach, the similarity between authors is computed based on the shared academic journals. Therefore, the basic assumption will be that authors who published in the same journals are more likely to share similar conceptual frameworks and interests (and therefore, they are more likely to be part of the same invisible college) than those who never publish in the same journals (Garcia et al., 2012; Ni et al., 2013; Price & Beaver, 1966). Thus, in our study, the mention of the name of a journal evokes all the articles that have been published by that journal and consequently represents a “conceptual marker,” (Ni et al., 2013). Our model is based on this idea: if the journal in which scholars published their work represents a conceptual marker, it follows that authors can be identified by the journals in which they publish their research (Garcia et al., 2012; Ni et al., 2013). In this way authors belonging to a research area can differentiate themselves by identifying an underlying group of journals. The authors will thus be grouped both socially and intellectually, the latter being the norm in citation studies.

In disciplines with high degrees of single authorship, hyper-authorship behavior does not determine a good descriptor of the interaction structure between the authors of the discipline. Therefore, in this scenario, collaborative networks based on hyper-authorship behavior are not appropriate. Alternatively, our approach identifies similarities between authors based on similarities in their journal publication profiles, and consequently, it does not matter whether or not authors interact with their community through co-authorship. Therefore, we follow (Minguillo, 2010)’s fundamental premises that “journals act as platforms of interaction and membership for scientific fields.” He stated that “analysis of the relation between authors and journals makes it possible to see how communication among scientists overall forms the structure of highly specialized and well-controlled scientific (sub)fields influenced by the reputational system and cognitive limitations.”

Using this premise, our research will investigate the degree to which authors (who published in information science and library science journals) are related to each other, based on their journal publication profile (i.e., their research output). Therefore, the grouping of authors using journal coupling (i.e., the shared academic journals) provides a new approach to examine the social and intellectual coherence of the considered set of authors.

In this study, after presenting our approach, we find the clusters of authors (or invisible colleges) in the information science and library science (IS&LS) category of the Web of Science Core Collection. Therefore, our results may be of interest to IS&LS scholars. This is because these results provide a new lens for grouping authors, making use of the authors’ journal publication profile and journal coupling. Furthermore, extending our approach to the study of the structure of other disciplines would possibly be of interest to historians of science as well as scientometricians.

2
Related works

To demonstrate the structure of scientific domains, several methods based on co-occurrence and coupling have been proposed. On the one hand, co-citation, co-wording, and co-authorship methods, are based on the co-occurrence of elements (Qiu et at., 2014; Small, 1973; Tijssen & Van Raan, 1994; White & McCain, 1998). For example, co-citation establishes a co-occurrence relationship between cited references. On the other hand, Kessler (1963) proposed the concept that “two documents are related if they share the same set of citations.” Bibliographic coupling is based precisely on this concept and, thus, researchers want to observe how many common references two articles share. Later Small and Koenig (1977) also proposed using bibliographic coupling of journals. In our study, the similarity of authors is based on this type of coupling assumption. We then claim that two authors are related if they share similar publication venues. This is journal coupling of authors. We used journals to discover relationships among authors. In journal coupling of authors, the number of shared publication venues among authors can be used to measure their relationships.

In relation to research works that connect authors through journals, Garcia et al. (2012) presented a novel methodology to map academic institutions based on their journal publication profiles. Using a sample of Spanish universities as a case study, Garcia et al., (2012) mapped the study sample according to the overall research output of each institution and in different disciplinary contexts. Furthermore, Robinson-Garcia et al., (2013) presented a descriptive analysis of Spanish universities according to their journal publication profile in five scientific domains during the period 2007-2011. In their study, two universities had a similar journal publication profile if they published in a high number of common journals. This idea led them to the possibility of mapping universities and thus offering an enriched view of the Spanish higher education system. Extending the idea of bibliographic coupling, Ni et al., (2013) proposed a metric called author-publication-place coupling that identifies similarities between journals based on the similarities of their author profiles. The use of this method in information science and library science journals provided evidence of four distinct subfields, namely, management information systems, specialized information and library science, library science-focused, and information science-focused research.

Therefore, our study introduces a novel approach for clustering authors on the basis of the common publication in academic journals. A map of authors can then be used to visualize the relationships between these authors using journal coupling. This tool will make it possible for individual scholars to see how they relate to other authors. Also, it may be relevant for studies exploring the scholars in a discipline or area, or, more in general, within the research management context (Noyons, 2004).

When visualizing the structure of science, the authors’ maps are commonly visualized as node-edge diagrams, similar to those used in network science. On one hand, the representation of the authors is carried out by placing each of them in a two-dimensional space (Klavans & Boyack, 2009). On the other hand, relationships between authors are represented by the explicit linking of pairs of authors (Klavans & Boyack, 2009). Henry Small and colleagues (Griffith et al., 1974; Small & Koenig, 1977; Small & Garfield, 1985) were the first to use bibliometric methodologies to map all of science. Their approach was simple: focus their attention on highly co-cited articles. In their developments, they defined these highly co-cited articles as pairs of references that coexist in bibliographies at least five times in a year. Using a different level of granularity than articles when visualizing the structure of science, specifically publication venues, Narin et al., (1972) chose academic journals as the basic unit for mapping science. To do this, Narin et al. (1972) first divided the journals into a certain number of groups and then used a visualization algorithm to generate a layout of the previously obtained groups. More recently, different techniques have been used to create discipline-based maps. For this, the large amounts of data that are currently available on scientific publications have been used. In addition to new visualization tools. Using this approach, Moya-Anegón et al., (2004, 2007) created several discipline-based maps. They used the Thomson Scientific disciplinary classification system. Leydesdorff and Rafols (2009) represents another relevant example of discipline-based maps.

In information science (IS) and library science (LS), different bibliometric analyzes have been carried out to study the structure of the research area. For example, Saracevic, (1999) seeing “the two as separate but closely related fields.” Analyzing the structure of library and information science (LIS), Moya-Anegón et al., (2006) found LS, together with information management, “being included or excluded depending on what level the co-citation analyzes” were done. However, according to a set of citation analyzes performed in (Astrom, 2010), “the patterns in the citation data support the concept of a joint LIS field with information science and library science being the two main subfields.” In our study, we determine the structure of information science and library science using journal coupling.

Several previous studies identified the most influential journals in the LIS field. For example, Vinkler, (2019) identified the core journals in scientometrics using the frequency of articles in academic journals in the elite publication subgroups of Price medalists. Nixon (2014), Walters and Wilder (2015), and Weerasinghe (2017) also found the most important journals for determining the structure of the research area using different approaches, mainly reflecting reputational factors based on bibliometric indicators or expert surveys. In a more recent paper, Safón and Docampo (2023) found trendy journals which are the most read by scholars who are publishing within the scope of a consolidated discipline. In the following sections, we also analyze the most relevant journals to determine the structure of information science and library science when using journal coupling. Our results are consistent with those of previous studies that identify which journals are truly influential within the discipline.

3
Data and methods

In this article we present an automatic system for grouping academics from a research area. The first choice in our model is related to the research production units that we will use as author descriptors. Here, given the availability of large amounts of data on scientific publications, we have used academic journals to represent the scholars’ research output. With this choice, instead of using more complex descriptors, we will avoid the problems shown in (Salton & Buckley, 1988). However, this will force us to use as descriptors the journals in which the author has published, which may not determine a complete identification of an author’s scientific results. This balance between precision and complexity will allow us to develop an operational representation of academics’ research output.

Taking all this into account, and with the aim of improving the use of academic journals as descriptors, it will be mandatory to differentiate the most important journals from those that are less important when grouping authors based on their shared academic journals, using journal coupling. Therefore, in our modeling we are going to introduce journal weights that allow us to make distinctions between those venues in which an academic has published, based on their value as a descriptor when using journal coupling. This will be shown in the following sections, where we present the generation of effective weighting factors attached to journals that act as author descriptors.

Therefore, in our problem, for each author in a discipline or research area, we record the academic journals in which these authors published their articles. Next, using the list of academic journals in which scholars have published their research results, we construct a journal-by-author matrix. In this matrix, each row contains the weights of the individual journals for each of the authors considered (see Section 3.1). These weights must take into account that journals in which a large proportion of academics have published are a poor indicator of the similarity between two specific authors. That is why in our model, as shown in Section 3.1, we will calculate the weights associated with journal descriptors using the inverse frequency method (Salton & Buckley, 1988).

Based on this journal-by-author matrix, we next measure the similarity between two authors using the approach for calculating the similarity between two documents proposed by (Ahlgren & Colliander, 2009). However, the similarity between authors can be obtained using two different approximations as briefly described below (see Section 3.2 for more details). Specifically, first-order similarities can be obtained by measuring the similarity between columns in a journal-by-author matrix. In this first-order approach, one focuses on the direct similarity between two authors. However, we can also obtain similarities by measuring the similarity between columns in this first-order author-by-author similarity matrix (see Section 3.2). This approach produces a new author-by-author similarity matrix, populated with second-order similarities. This second-order approach finds that two authors are similar by detecting that there are other scholars such that both authors are similar to each of them. In a scientometric context,Ahlgren and Colliander (2012), and Janssens (2007) found that the second-order approach works better than the first-order.

There is only one step left to execute if we want to obtain the clusters of authors that arise from the second-order similarities. Clustering analysis is then used to group the authors based on the similarity values (see Section 3.3). More precisely, we used hierarchical clustering and the complete linkage method to cluster analysis (Everitt et al., 2001). In this article, we illustrate the performance of our approach using a set of 302 authors who have published articles in the IS&LS category of the Web of Science Core Collection (see Section 4). For this group of scholars, we found four main invisible colleges that are made up of authors who frequent the same journals, and therefore, share similar conceptual frameworks.

All the software used in this analysis and the raw data collected are available at https://github.com/rosadecsai/grouping_authors.git.

3.1
Journal vector for author representation

We assume a given set of authors A = {ai} that we want to study to find invisible colleges in a research area. In our study, the basic assumption is that relationships between the research output of different authors in A can be found by comparing the academic journals in which these authors published their work. Therefore, two authors in A are related if they share similar publication venues (journal publication profiles). Our grouping of authors is then based on journal coupling.

In our model, J = {jm} represents the list of journals where authors in A published their work. We then constructed a journal-by-author matrix W = {wm,i}, with wm,i being the weight for representing the ai’s research output using as descriptor the journal jm. Therefore, the author ai can be represented using a vector of descriptors and journal weights as follows (see (Salton & Buckley, 1988) for further details): 1Jai=j1,w1,i;j2,w2,i;;jM,wM,i{J_{{a_i}}} = \left( {{j_1},{w_{1,i}};{j_2},{w_{2,i}}; \ldots ;{j_M},{w_{M,i}}} \right) with M being the size of J = {jm}.

In our model, each journal receives a weight that represents its level of relevance as a descriptor of the author. The journal weights wm,i can vary their value between 0 and a maximum value. This determines a greater degree of discrimination between the journals of J that intervene as descriptors. Thus, the most relevant journals as descriptors of an author receive a greater weight. On the contrary, academic journals that are less relevant as author descriptors will receive lower weight assignments, close to 0, or even a value of 0 if the author has not published any work in the journal. In this approach, therefore, there is no threshold for considering a journal to be relevant.

An author may have published several articles in different academic journals. However, when we compare this author with the rest in A, not all of these journals will be equally relevant when acting as descriptors of the author’s scientific production. In that sense, the best descriptors will be those journals that are truly capable of distinguishing certain authors from the rest in A. This implies that the best journals jm for representing the research output of author ai should have high journal frequencies (i.e., journals in which ai frequently published their work) but low overall frequencies across authors in A.

In our model, freqm,i represents the number of papers that author ai published in journal jm. Since a journal in which a large proportion of scholars published their work often is a bad indicator of similarity between authors, it is reasonable to weight a journal jm in accordance with how frequently different authors in A published their work in this journal. In our model, we used the inverse frequency factor to perform this function (Salton & Buckley, 1998): 2logNnm\log \left( {{N \over {{n_m}}}} \right) with N being the size of A = {ai}; and nm being the number of scholars who published in journal jm.

Therefore, in our model, the value of journal jm as descriptor of author ai (wm,i) is the product of the journal frequency and the inverse frequency factor (Ahlgren & Colliander, 2009; Salton & Buckley, 1988): 3wm,i= freq m,i×logNnm{w_{m,i}} = {\rm{ freq}}{{\rm{ }}_{m,i}} \times \log \left( {{N \over {{n_m}}}} \right) where freqm,i is the number of articles that author ai published in journal jm; and the inverse frequency factor logNnm\log \left( {{N \over {{n_m}}}} \right) varies inversely with the number of scholars who published in jm.

3.2
Author-author similarities based on journal coupling

Now, the similarity between authors in A is determined based on the number of shared journals between these authors. That is, the similarity value is measured using journal coupling. Recall that, following (Garcia et al., 2012; Ni et al., 2013), the underlying assumption of our approach is that authors who publish in the same journals are more likely to share similar conceptual frameworks, and thus to be part of the same invisible college, than those who never publish in the same venues.

From equation (1), the similarity between authors ai and aj in A could be calculated using the vector product formula. However, this formula does not take into account that the weights of the journals must depend to some extent on the values of the weights assigned to the other journals in the same vector. In our model, we solve this problem by using a length normalized journal-weighting system. However, introducing this normalization, as demonstrated in (Baeza-Yates & Ribeiro-Neto, 1999), the value of the similarity between two authors turns out to be the cosine of the angle between the two journal vectors that represent authors ai and aj: 4Bai,aj=mwm,i×wm,jmwm,i2mwm,j2B\left( {{a_i},{a_j}} \right) = {{\sum\limits_m {{w_{m,i}}} \times {w_{m,j}}} \over {\sqrt {\sum\limits_m {{{\left( {{w_{m,i}}} \right)}^2}} } \sqrt {\sum\limits_m {{{\left( {{w_{m,j}}} \right)}^2}} } }} with wm,i (wm,j) being the jm ‘s weight for representing author ai (aj); and sums are over all journals in the set J.

This first-order approach measures the similarity between two authors directly using their journal vector representation. However, alternatively, a second-order strategy finds that two authors are similar by detecting that there are other scholars in A such that both authors are similar to each of them. Thus, using equation (4), a second-order similarity matrix is defined as follows (see (Ahlgren & Colliander, 2009) for more details): 5Sai,aj=kBak,ai×Bak,ajkBak,ai2kBak,aj2S\left( {{a_i},{a_j}} \right) = {{\sum\limits_k B \left( {{a_k},{a_i}} \right) \times B\left( {{a_k},{a_j}} \right)} \over {\sqrt {\sum\limits_k {{{\left( {B\left( {{a_k},{a_i}} \right)} \right)}^2}} } \sqrt {\sum\limits_k {{{\left( {B\left( {{a_k},{a_j}} \right)} \right)}^2}} } }} where sums are over all authors in the set A. Therefore, this second-order approach measures the similarity between two authors using the first-order measure of similarity between scholars in A.

3.3
Clustering analysis

The last stage of our approach (before interpreting the results obtained) will be to group the authors in A using agglomerative hierarchical clustering. In this effort, we first obtain dissimilarity values between authors in A. For its calculation we use the second-order similarity values obtained in the equation (5). The dissimilarity value is obtained by subtracting the given similarity value from 1.

In our clustering analysis, we will initially have as many clusters as there are individual authors in A. That is, each author determines a single cluster. However, subsequently and at each stage of the agglomerative grouping, the closest clusters according to a measure of distance between them, join together forming a new, larger group of authors. Agglomerative hierarchical clustering stops when a single author group remains, which will consist of all authors of A.

In our analysis, the distance measure between clusters is obtained using the complete linkage method proposed in (Everitt et al., 2001). Therefore, the distance between two clusters is obtained by calculating the maximum distance between pairs of authors, the first of them belonging to one cluster and the second author to the other cluster. To do this, at each stage of the agglomerative clustering, the distance between two groups of authors is calculated as the maximum second-order dissimilarity between two authors, one from the first group and one from the second.

In our study, we used the SciPy package (Virtanen et al., 2020) to perform the complete linkage clustering based on second-order dissimilarities between authors in A. The components at each iterative step of the agglomerative clustering are always a subset of authors or groups of authors. Hence, a tree diagram, or dendrogram will be used to represent the grouping of authors. At a given level of the clustering, the clusters that exist above and below a grouping threshold are obtained simply using horizontal slices of the tree. Thus, in the next section, dendrograms illustrate the agglomerative hierarchical clustering of authors based on journal coupling. However, what will be the cutoff threshold defined in the agglomerative hierarchical clustering? In our experiments we used the maxclust criterion in fcluster of the SciPy package (Virtanen et al., 2020): fcluster (Z, nu mclust, criterion=‘maxclust’). Using the maxclust criterion, we find an optimal distance between each pair of points which are going to be in the same cluster. All the software used in this analysis is available at https://github.com/rosadecsai/grouping_authors.git.

4
Case of study: Map of IS&LS authors based on journal coupling

In this section, we illustrate the grouping of authors using journal coupling. The results shown in this paper are from a controlled exercise. We analyzed 302 authors who had published in the IS&LS category of the Web of Science Core Collection. These were authors who published an article between 2022 and 2024 or who published a highly cited article between 2014 and 2024.

We downloaded the complete list of academic journals in which they had published all their works. For each author, we retrieved all the scientific journals in which this author had published his/her articles. We then used the cosine measure to calculate the similarity between authors (both first and second order).

Figure 1 illustrates the dendrogram of IS&LS authors according to their journal publication profile (using journal coupling as described above). To obtain this dendrogram of IS&LS authors, we used the complete linkage method for clustering the 304 IS&LS authors, using second-order dissimilarities. Based on the maxclust criterion of the SciPy package (Virtanen et al., 2020), we found four distinct clusters according to similarities in their research output. Authors who frequent the same journals are more likely to be part of the same invisible college (Price & Beaver, 1966). This is because by publishing in the same journals, they share similar conceptual frameworks. Thus, from the four distinct clusters in Figure 1 using the maxclust criterion, we found four invisible colleges in the list of authors. Figure 2 illustrates a visualization in VOSviewer of these four invisible colleges. This map is a node-edge diagram, which places each author on the plane and links pairs of scholars using their corresponding author-author similarities.

Figure 1.

Dendrogram of IS&LS authors according to their journal publication profile.

Figure 2.

Visualization in VOSviewer of a map for IS&LS authors using journal coupling.

The four invisible colleges using journal coupling of authors were: (i) “Information Systems” invisible college (see Figure 3); (ii) “Business and Information Management” invisible college (see Figure 4); (iii) “Quantitative Information Science” invisible college (see Figure 5); and (iv) “Library Science” invisible college (see Figure 7). From the dendrogram of the agglomerative hierarchical clustering (see Figure 1), we see that the invisible colleges of “Quantitative Information Science” and “Information Systems” determine two very strong groupings. However, the invisible colleges of “Library Science” and “Business and Information Management” are relatively less strong clusters and show a more diversified journal publication profile (see Figures 3, 4, 5, and 7). Tables 14 illustrate the 100 journals of higher weight to determine each invisible college based on journal coupling.

Figure 3.

Visualization in VOSviewer of the ‘Information Systems’ invisible college.

Figure 4.

Visualization in VOSviewer of the “Business and Information Management” invisible college.

Figure 5.

Visualization in VOSviewer of the “Quantitative Information Science” invisible college.

Table 1.
The 100 journals of higher weight to determine the invisible college of “Information Systems”
Information Systems
RankJournal AbbreviationWeightRankJournal AbbreviationWeight
1INFORM SYST RES153.8851J GLOB INF MANAG6.33
2INFORM SYST J122.6952J ORGAN END USER COM6.24
3J MANAGE INFORM SYST117.1453ELECTRON COMMER RES6.24
4J ASSOC INF SYST110.9054J BUS ETHICS6.24
5EUR J INFORM SYST104.6755EUR J OPER RES6.24
6MIS QUART75.6656MANAGE SCI6.04
7P ANN HICSS71.3957INT FED INFO PROC5.75
8INFORM MANAGE-AMSTER52.3658J ELECTRON COMMER RE5.75
9BUS INFORM SYST ENG+49.9159UROLOGY5.55
10J STRATEGIC INF SYST38.8260PERS INDIV DIFFER5.55
11J INF TECHNOL-UK38.1261BEHAV BRAIN RES5.55
12DECISION SCI36.0462J ENDOUROL5.55
13INT J INFORM MANAGE30.7863DES ISSUES5.55
14DECIS SUPPORT SYST29.0664CHINA ECON REV5.55
15INFORM MANAGE26.3465TELECOMMUN POLICY5.55
16J UROLOGY26.3466LECT NOTES BUS INF P5.47
17DATA BASE ADV INF SY25.6567J ORG COMP ELECT COM4.89
18PROD OPER MANAG24.9568INT J HUM-COMPUT ST4.89
19ELECTRON MARK22.1869IFIP ADV INF COMM TE4.85
20OMEGA-INT J MANAGE S19.4170J INFORM TECHNOL4.16
21MIS Q EXEC19.4171PERS PSYCHOL4.16
22COMMUN ACM19.2772RES TECHNOL MANAGE4.16
23ORGAN SCI18.0273J TRADIT CHIN MED4.16
24MIT SLOAN MANAGE REV15.2574ECON MODEL4.16
25ELECTR J INF SYS DEV14.5675J MARKETING RES4.16
26INT J ELECTRON COMM13.8676J ASIAN NAT PROD RES4.16
27IEEE T ENG MANAGE13.5277WORLD ECON4.16
28ADV MANAG INFORM SYS13.1778INT J PROD RES4.16
29J BUS RES13.1779IEEE T SYST MAN CY C4.16
30GROUP DECIS NEGOT13.1780CHIN MED-UK4.16
31IND MANAGE DATA SYST12.4881J INFECT DIS4.16
32IT PROF12.4882J DECIS SYST4.16
33L N INF SYST ORGAN12.4883CHANDOS ASIAN STUD4.16
34COMPUT SECUR12.4884INT J ACCOUNT INF MA4.16
35PAC ASIA J ASSOC INF11.7885J POLYNESIAN SOC4.16
36WIRTSCHAFTSINF11.0986J ORGAN EFF-PEOPLE P4.16
37ELECTRON COMMER R A10.0787ACM TRANS MANAG INF4.16
38SMALL GR RES9.7088FEBS LETT4.16
39INFORM SYST MANAGE9.7089BIOCHEM INT4.16
40ORGAN BEHAV HUM DEC9.7090J SMALL BUS MANAGE3.47
41DATA BASE9.7091J OPER RES SOC3.47
42J INF TECHNOL9.0192ACAD MANAGE J3.47
43J OPER MANAG9.0193INF SYST E-BUS MANAG3.47
44INFORM ORGAN-UK9.0194J AM MED INFORM ASSN3.47
45J APPL PSYCHOL8.3295INFORM TECHNOL DEV3.47
46IEEE T PROF COMMUN7.6296ANN OPER RES3.47
47COMPUT HUM BEHAV7.1997INFORM SOC3.47
48J MARKETING6.9398FRONT PSYCHOL2.88
49J ACAD MARKET SCI6.9399J GLOB INF TECH MAN2.88
50PEDIATR INFECT DIS J6.93100BIOMED PHARMACOTHER2.77
Table 2.
The 100 journals of higher weight to determine the invisible college of “Business and Information Management”.
Business and Information Management
RankJournal AbbreviationWeightRankJournal AbbreviationWeight
1J BUS RES172.5951REV MANAG SCI19.41
2J KNOWL MANAG135.8652BENCHMARKING19.41
3INT J INFORM MANAGE92.9253SPRINGERBRIEF BUS19.41
4PROD PLAN CONTROL85.9554J CONSUM BEHAV19.41
5J HOSP MARKET MANAG70.7055INT J PROD ECON18.99
6INT J CONTEMP HOSP M70.0156J TECHNOL TRANSFER18.71
7J TRAVEL RES69.3157IEEE T ENG MANAGE18.41
8ANN OPER RES67.2458INNOV PUBLIC SECT18.02
9TECHNOL FORECAST SOC66.7459SERV IND J17.33
10IND MANAGE DATA SYST64.4660INT J E-BUS RES16.64
11ANN TOURISM RES59.6161ECON RES-EKON ISTRAZ16.64
12INT J HOSP MANAG59.6162J STRATEG MARK16.64
13J SUSTAIN TOUR52.6863TECHNOVATION16.64
14INT J INDIAN CULT BU51.2964J DESTIN MARK MANAGE16.64
15INT J ELECTRON GOV R49.2165PUB ADMIN INF TECH15.94
16INT J PROD RES49.2166CREAT INNOV MANAG15.25
17BUS STRATEG ENVIRON48.5267ENTERP INF SYST-UK15.25
18COMPUT HUM BEHAV46.0368INT J TOUR RES15.25
19J INTELLECT CAP44.3669J UNIVERS COMPUT SCI15.25
20INFORM SYST MANAGE44.3670EUR MANAG J15.25
21J ENTERP INF MANAG40.9071TOUR ANAL15.25
22INT J ENTREP BEHAV R40.2072INT J INNOV LEARN13.86
23GOV INFORM Q39.7073RESOUR CONSERV RECY13.86
24IND MARKET MANAG38.1274INT MARKET REV13.86
25EUROMED ACAD BUS CON37.4375INT J CONSUM STUD13.86
26INT J MOB COMMUN36.0476EUR MANAG REV13.86
27INT J BANK MARK33.9677INT J RETAIL DISTRIB13.86
28J RETAIL CONSUM SERV33.6678J BUS IND MARK13.86
29TOUR REV33.2779J HOSP TOUR TECHNOL13.86
30BRIT FOOD J33.2780INT J OPER PROD MAN13.86
31INT ENTREP MANAG J31.8881ELECTRON MARK13.17
32TRANSFORM GOV-PEOPLE31.1982EUR J INFORM SYST13.17
33P ANN HICSS30.5083MANAGE DECIS12.95
34EUR J INT MANAG29.1184J SERV MARK12.48
35PSYCHOL MARKET29.1185INT J LOGIST MANAG12.48
36J HOSP TOUR RES27.7386BRIT J MANAGE12.48
37EUR J MARKETING27.7387PUBLIC MANAG REV12.48
38BUS PROCESS MANAG J27.7388INT J EMERG MARK12.48
39TOTAL QUAL MANAG BUS26.3489J PROD BRAND MANAG12.48
40ADV THE PRAC EMER MA26.3490SUSTAINABILITY-BASEL12.37
41J HOSP TOUR MANAG26.3491EUR J INNOV MANAG11.78
42J TRAVEL TOUR MARK26.3492INT J INNOV MANAG11.09
43ROUTLEDGE STUD MARK24.9593FOOD QUAL PREFER11.09
44INNOV TECH KNOWL MAN24.9594IFAC PAPERSONLINE11.09
45INFORM MANAGE-AMSTER23.8895CORP SOC RESP ENV MA11.09
46ASIA PAC J TOUR RES23.5796INT J ENTREP VENTUR11.09
47J INNOV KNOWL22.1897COMPUT IND ENG11.09
48TOUR MANAG PERSPECT20.7998J ORGAN BEHAV11.09
49TOURISM MANAGE20.4399J INT MANAG11.09
50CURR ISSUES TOUR20.10100J INT CONSUM MARK11.09
Table 3.
The 100 journals of higher weight to determine the invisible college of “Quantitative Information Science”
Quantitative Information Science
RankJournal AbbreviationWeightRankJournal AbbreviationWeight
1J INFORMETR366.6751FRONT INFORM TECH EL4.16
2PRO INT CONF SCI INF234.2852ACM-IEEE J CONF DIG4.16
3PROF INFORM99.1253INVESTIG BIBLIOTECOL4.16
4QUANT SCI STUD74.8654CAN J SOCIOL4.16
5RES EVALUAT42.9855ACTA PHYS POL A4.16
6MATH COMPUT MODEL34.6656INT CONF BIG DATA4.16
7J DATA INFO SCI31.1957THEOR CHEM ACC4.16
8J AM SOC INFORM SCI29.1158NAT HUM BEHAV4.16
9SPRINGER HBK29.1159HIGH EDUC4.03
10REV ESP DOC CIENT27.0360INFORM RES4.03
11SCI PUBL POLICY26.3461DATA KNOWL ENG3.47
12ONLINE INFORM REV16.4062LIBRI3.47
13J DOC14.6763HUM SOC SCI COMMUN3.47
14COLLNET J SCIENTOMET14.5664COMUNICAR3.47
15CURR SCI INDIA13.8665RES POLICY3.45
16EMBO REP13.8666ELECTRON LIBR3.16
17LEARN PUBL13.1767J SUPERCOMPUT2.77
18ASLIB J INFORM MANAG12.9568J ORTHOP SURG RES2.77
19P ASIST ANNU11.0969ECON POLIT-ITALY2.77
20J CHEM PHYS11.0970CHIMIA2.77
21ELIFE9.7071DATA2.77
22SCI ENG ETHICS9.7072J LIBR INFORM STUD2.77
23J SCIENTOMETR RES9.7073INT J HEALTH POLICY2.77
24ASLIB PROC9.7074PHYS LETT B2.77
25NATURE9.2175INFORM VISUAL2.77
26CLIMATE8.3276MATDEMATICS-BASEL2.77
27REV ESP SALUD PUBLIC8.3277UNIV PSYCHOL2.77
28J ECON SURV6.9378J EVOL ECON2.77
29BMC BIOINFORMATICS6.9379J ALZHEIMERS DIS2.77
30CRIMINOLOGIE6.9380PAC J MATH2.77
31PRESSE MED6.9381STUD CLASS DATA ANAL2.77
32LIBR INFORM SCI RES6.3382RES HIGH EDUC2.77
33J INTELL FUZZY SYST6.2483ACTA PAEDIATR2.77
34LIBR INFORM SCI SER6.2484ANN I H POINCARE B2.77
35MEAS-INTERDISCIP RES5.5585INT J STROKE2.77
36IEEE INT CON MULTI5.5586ESTUD MENSAJE PERIOD2.77
37CAN J INFORM LIB SCI5.5587PALGR COMMUN2.77
38HIGH EDUC Q5.5588SOC STUD SCI2.77
39Z EVAL5.5589EVALUATION REV2.77
40J ORGANOMET CHEM5.5590PHYS REV LETT2.77
41IEEE INT CONF FUZZY5.5591PSICOTHEMA2.77
42PUBLICATIONS5.5592IEEE INT C BIO BIO W2.77
43J LIBR INF SCI5.5593IETE TECH REV2.77
44MALAYS J LIBR INF SC5.1894MED CLIN-BARCELONA2.77
45COLL RES LIBR4.8595J TECHNOL TRANSFER2.77
46J KOREAN MED SCI4.1696TECHNOVATION2.77
47INT CONF CONTEMP4.1697EDUC XX12.77
48MINERVA4.1698AIP CONF PROC2.77
49KNOWL ORGAN4.1699INFORM POL2.77
50FEMS MICROBIOL LETT4.16100DOC BIBL2.77
Table 4.
The 100 journals of higher weight to determine the invisible college of “Library Science”.
Library Science
RankJournal AbbreviationWeightRankJournal AbbreviationWeight
1INT J GEOGR INF SCI198.2451HEALTH AFFAIR18.02
2J AM MED INFORM ASSN120.6152COLLECT BUILD18.02
3LEARN PUBL113.6853PEDIATRICS17.33
4PROF INFORM92.1954APPL CLIN INFORM17.33
5CHANDOS INF PROF SER83.1855IEEE J-STARS16.64
6J LIBR ADM67.9356NPJ DIGIT MED16.64
7ASLIB PROC58.9257PUBLIC LIBR Q16.64
8J LIBR INF SCI57.5358APPL GEOGR15.25
9REV BELGE PHILOL HIS56.8459AGR FOREST METEOROL15.25
10EDINB STUD CLASS ISL48.5260JAMA-J AM MED ASSOC15.25
11AM HIST REV44.3661MALAYS J LIBR INF SC15.25
12LIBR MANAGE44.3662J DOC14.96
13EM QUESTAO42.9863JAMA NETW OPEN13.86
14J AUST LIB INF ASSOC37.4364GISCI REMOTE SENS13.86
15T GIS37.4365INTERLEND DOC SUPPLY13.86
16LIBR HI TECH36.5466INFORM SOC-ESTUD13.86
17ISPRS INT J GEO-INF34.6667INT J CLIN MONIT COM13.86
18LANDSCAPE URBAN PLAN33.2768DIGIT LIBR PERSPECT13.86
19J GEN INTERN MED31.8869AUST ACAD RES LIBR12.48
20SYNTH REACT INORG M30.5070BMJ QUAL SAF12.48
21COMPUT ENVIRON URBAN29.1171NEW REV ACAD LIBR12.48
22SPECULUM27.7372JOURNALISM12.48
23J ASIAN AFR STUD27.7373NEW LIB WORLD11.09
24HEALTH INFO LIBR J26.3474MUSLIM WORLD AGE CRU11.09
25IFLA J-INT FED LIBR26.3475JAMA INTERN MED11.09
26LIBRI25.6576ANN EMERG MED11.09
27IEEE T GEOSCI REMOTE24.9577LANDSCAPE ECOL11.09
28REV IBERI-AM CIENC I24.9578ISLAM11.09
29GLOB KNOWL MEM COMMU23.5979LECT NOTES GEOINF CA11.09
30INT J REMOTE SENS23.5780HEALTH SERV RES11.09
31AFR J LIBR ARCH INFO23.5781EVID BASED LIB INF P11.09
32QUAL QUANT METHODS L23.5782RDBCI-REV DIG BIB CI11.09
33J ACAD LIBR23.0183PHOTOGRAMM ENG REM S11.09
34LIBR RESOUR TECH SER22.1884INF DISCOV DELIV10.93
35J TRANSP GEOGR20.7985REMOTE SENS ENVIRON10.40
36S AFR J INFORM MANAG20.7986PORTAL-LIBR ACAD10.40
37ANN INTERN MED20.7987ONLINE INFORM REV10.07
38LIBR REV20.7988ASLIB J INFORM MANAG9.78
39B SCH ORIENT AFR ST20.7989COLLECT CURATION9.70
40ANN AM ASSOC GEOGR20.7990CAT CLASSIF Q9.70
41AM J MANAG CARE20.7991SCI CHINA SER D9.70
42INFORM DEV20.7192HEALTHCARE-J DEL SCI9.70
43J INFORM OPTIM SCI19.4193J SCHOLARLY PUBL9.70
44NEW ENGL J MED19.4194SERIALS REV9.70
45ELECTRON LIBR19.2795PERFORM MEAS METR9.70
46S AFR J LIBR INF18.7196JAMIA OPEN9.70
47PERSPECT CIENC INF18.0297BIBLIOS9.70
48ENCONTROS BIBLI18.0298INT J APPL EARTH OBS9.70
49PEDIATR RES18.0299ARCH INTERN MED9.70
50DESIDOC J LIB INF TE18.02100IEEE T NEUR NET LEAR9.70

For example, for the invisible college of “Information Systems”, the academic journals with the highest weights for representing the research output of authors are five (see Table 1): (1) INFORM SYST RES; (2) INFORM SYST J; (3) J MANAGE INFORM SYST; (4) J ASSOC INF SYST; and (5)EUR J INFORM SYST. These five journals publish studies of the highest quality in the field of information systems. Their articles promote knowledge about the design, management, use, assessment and impacts of information technologies by individuals, groups, organizations, societies and nations for the improvement of economic and social well-being. To this end, they integrate technological disciplines with social, contextual and management issues. Distinguished scholars within this invisible college would be, for example, Dennis, Davinson, Chan, Sarker, Grover, Pan, Benitez, and Lowry (see Figure 3). Table A1 (in Appendix) illustrates the complete list of authors in this invisible college of “Information Systems” using journal coupling.

However, for the invisible college of “Business and Information Management”, the academic journals with the highest weights for representing the authors’ research output are six (see Table 2): (1) J BUS RES; (2) J KNOWL MANAG; (3) INT J INFORM MANAGE; (4) PROD PLAN CONTROL; (5)J HOSP MARKET MANAG; and (6) INT J CONTEMP HOSP MANAG. These journals aim to publish research that examines a wide variety of business decision contexts, processes, and activities. These articles also focus on the challenge for information management and the emerging needs of the industry. This includes the management of activities that generate changes in the behavioral patterns of customers, people and organizations, as well as information that leads to changes in the way people use information to participate in knowledge-focused activities. Distinguished scholars within this invisible college of “Business and Information Management” would be, for example, Pandey, Dwivedi, Dennehy, Meissner, Kar, Khare, Pappas, Papadopoulos, and Popa (see Figure 4). Table A2 (in Appendix) illustrates the complete list of authors in this invisible college of “Business and Information Management” using journal coupling.

For the invisible college of “Quantitative Information Science”, the academic journals with the highest weights for representing the research output are four (see Table 3): (1) J INFORMETR; (2) PRO INT CONF SCI INF; (3) PROF INFORM; and (4) QUANT SCI STUD. This group of authors included several scholars who are among the recipients of the Derek de Solla Price Memorial Medal, the first and the most important international prize in scientometric studies, e.g., Bornmann, Thelwall, Waltman, Rousseau, and Egghe (see Figure 5). In this group we also found some of the most significant and influential LIS scholars during the time examined, e.g., Lariviere, Costas, Sugimoto, D’Angelo, Abramo, Wouters, Zahedi, Delgado Lopez-Cozar, Moya-Anegón, and so on (see Figure 5). The academics in this cluster have a high level of production and more links with the rest of the authors who seem to surround them. Table A3 (Appendix) illustrates the complete list of authors in this invisible college of “Quantitative Information Science” using journal coupling.

In Figure 6, we can see how high levels of similarity correspond with similar journal publication profiles: Bornmann’s and Rousseau’s research output is very similar (similarity = 0.99) and highly focused on four main journals (SCIENTOMETRICS, J INFORMETR, J ASSOC INF SCI TECH/J AMSOC INF SCI TEC, PRO INT CONF SCI INF) which contain more than 64% of the total production for both authors. In the same Figure 6, we found that the similarity between Bornmann and Stuart is much smaller (similarity = 0.66). However, for both authors, seven journals contain more than 43% of their production (see Figure 6).

Figure 6.

Detail of journal similarities among Bornmann, Rousseau, and Stuart according to the IS&LS category of the Web of Science Core Collection.

For the invisible college of “Library Science”, the academic journals with the highest weights for representing the authors’ research output are eight (see Table 4): (1) INT J GEOGR INF SCI; (2) J AM MED INFORM ASSN; (3) LEARN PUBL; (4) PROF INFORM; (5) CHANDOS INF PROF SER; (6) J LIBR ADM; (7) ASLIB PROC; and (8) J LIBR INF SCI. These journals publish articles that reflect all aspects of library and information science. They focus on the practices, perspectives and tools of management, information technology, education and other areas of libraries. This includes the collection, organization, preservation and dissemination of information resources. They also cover the political economy of information, as well as the design, implementation and use of geographic information, medical in-formation and other special libraries for monitoring, prediction and decision making. Distinguished scholars within this invisible college of ‘Library Science’would be, for example, Zipf, Ho, Nicholas, Yuvaraj, Holley, Verma, Jamali, Rodriguez-Bravo, and Boukacem-Zeghmouri (see Figure 7). Table A4 (In Appendix) illustrates the complete list of authors in this invisible college of “Library Science” using journal coupling.

Figure 7.

Visualization in VOSviewer of the “Library Science” invisible college.

For the invisible college of library science, Figure 8 illustrates some details of journal similarities among Batool, Warraich, and Onyancha according to the IS&LS category of the Web of Science Core Collection. We can see how high and low levels of similarity correspond with certain journal publication profiles. For example, Batool’s and Warraich’s research output is similar (similarity = 0.93) and highly focused on seven main journals which contain 53% and 45% of the total production, respectively. In the same Figure 8, we found that the similarity between Batool and Onyancha is smaller (similarity = 0.63). However, only five journals still contain 44% and 38% of their production, respectively (see Figure 8).

Figure 8.

Detail of journal similarities among Batool, Warraich, and Onyancha according to the IS&LS category of the Web of Science Core Collection.

Figure 9 illustrates the map of invisible colleges according to their journal publication profile in IS and LS. This map is a node-edge diagram, which places each invisible college on the plane and links pairs of invisible colleges using their second-order similarities. On one hand, the invisible college “Quantitative Information Science” has the highest production by author (represented by node size) and a moderately strong link to Library Science. On the other hand, Information Systems and Business & Information Management are (relatively) less productive invisible colleges that show similarities with each other in certain fields of activity. They have weak links to Quantitative Information Science and Library Science (see Figure 9).

Figure 9.

Map of invisible colleges according to their journal publication profile in IS and LS.

However, what are the academic journals that demonstrate that IS and LS determine a single field of research, since authors from all the invisible colleges publish in these journals? Are there academic journals of this type and what are they? Table 5 shows the list of academic journals in which authors from all the invisible colleges have published. In this table we see that authors from all invisible colleges have published in Plos One and Lect Notes Compt Sc. Academic journals such as Expert Syst Appl, Internet Res, Technol Forecast Soc, or Telemat Inform also appear on this list. But what are the specific journals of the research area in which authors from all the invisible colleges publish? There are seven main journals that stand out in this regard and they are J Am Soc Inf Sci Tec/J Assoc Inf Sci Tech, Scientometrics, Int J Inform Manage, Inform Process Manag, Inform Technol Peopl, Inform Syst Front, and J Inf Sci (see Table 5). These journals tell us about the existence of a unique research field, which appears to encompass several subfields of research (i.e., the detected invisible colleges).

Table 5.
Academic journals in which authors from all the invisible colleges have published.
Journal AbbreviationInform SystBus & Inform ManagQuant Inform SciLibr Sci
ADV MATER RES-SWITZ73210
BEHAV INFORM TECHNOL203912
COMM COM INF SC49814
COMMUN ACM671212
COMMUN ASSOC INF SYS1232218
COMPUT EDUC21313
ENVIRON SCI POLLUT R15120
EXPERT SYST APPL282328
IEEE ACCESS18311
IEEE T ENG MANAGE477213
INFORM PROCESS MANAG1057942
INFORM SCIENCES23212
INFORM SYST FRONT25207111
INFORM TECHNOL MANAG1721
INFORM TECHNOL PEOPL357937
INT J ENV RES PUB HE57215
INT J HUM-COMPUT INT22823
INT J INFORM MANAGE107366334
INT J MED INFORM24127
INTERNET RES558148
J AM SOC INF SCI TEC20725218
J ASSOC INF SCI TECH91025635
J CLEAN PROD2781116
J COMPUT INFORM SYST106515
J COMPUT-MEDIAT COMM1337
J ENVIRON MANAGE27134
J INF SCI356166
J MED INTERNET RES910324
LECT NOTES ARTIF INT33410
LECT NOTES COMPUT SC131582667
NEW MEDIA SOC15312
P NATL ACAD SCI USA3147
PLOS ONE61212472
SCI REP-UK36519
SCIENTOMETRICS2784641
TECHNOL FORECAST SOC524976
TELEMAT INFORM374317
TOURISM MANAGE171121
5
Discussion and conclusions

Our results based on journal coupling show four main invisible colleges in LIS (i.e., Library Science, Quantitative Information Science, Information Systems, and Business and Information Management). These results are consistent with those obtained in (Astrom, 2010) using co-citation maps, as well as the co-occurrence of shared references between IS and LS authors. Using citation data from publications in LIS journals, Astrom (2010) found that IS and LS determine two main subfields in a joint field of LIS, where a further division into more specialized research areas can also be found when analyzing data from IS citations. On the contrary, LS research areas were less visible in citation analyses. Thus, what are the academic journals that determine the existence of subfields in LIS? They are journals in which authors from a certain invisible college publish very frequently and in which the authors from other colleges have never published. For example, in the invisible college of Information Systems, there are four journals that stand out and determine a subfield of research due to their weight in the discrimination of IS and LS authors. They are Inform Syst Res, Inform Syst J, J Manage Inform Syst, and Eur J Inform Syst (see Table 1).

In the invisible college of Business and Information Management, there are two main academic journals that determine the existence of this research subfield, due to the frequency of publication in them (see Table 2): J Bus Res and J Knowl Manage.

In the invisible college where the most significant and influential LIS academics were found (Quantitative Information Science), the journals with the highest weights when it comes to discriminating this research subfield are fundamentally two (see Table 3): J Informetr and Pro Int Conf Sci Inf. Furthermore, Prof Inform and Quant Sci Stud also played a prominent role, the latter being a recently created journal (see Table 3).

Finally, three academic journals are the most important in determining the invisible college of Library Science, specifically, Int J Geogr Inf Sci, J Am Med Inform Assn, and Learn Publ (see Table 4). This tells us about the importance that special libraries have in this subfield, more precisely, geographic information and medical information.

Our results regarding the most relevant journals to determine the structure of information science and library science, are consistent with those of previous studies. For example, Vinkler (2019) identified among the core journals in scientometrics the following journals: Scientometrics; Journal of the American Society for Information Science and Technology; Information Processing and Management; Journal of Information Science; Research Policy; Library Trends; and Research Evaluation. Our results on the most important journals for determining the structure of the research area also coincide with those presented in (Nixon, 2014; Walters & Wilder, 2015; Weerasinghe, 2017). In a more recent paper, Safón and Docampo (2023) found that “the top 10 journals that most influenced papers published between 2021 and March 2022 were: Scientometrics; International Journal of Information Management; Journal of the Association for Information Science and Technology; Quantitative Science Studies; MIS Quarterly; Information and Management; Information Processing and Management; Journal of the Association for Information Systems; Journal of Informetrics; Journal of Academic Librarianship.” All these journals are, according to our approach based on journal coupling, of great importance either to define LIS as a field of research (see Table 5) or to determine the structure of IS and LS into invisible colleges (see Tables 1-4).

However, our approach and the results obtained not only inform the structure of information science and library science, but also provide an adaptable methodology that can be used in other areas of research and extended to answer additional research questions. For example, to the problem of finding potential reviewers in complex evaluation processes. Ultimately, this paper served as a proof of concept for finding invisible colleges in a research area, using journal coupling. Regarding limitations and further research, a first limitation of our study is that the results shown in this paper are from a controlled exercise. We analyzed 302 authors who published in the IS&LS category of the Web of Science Core Collection. In future work, we will explore alternative approaches to identifying the authors that will be used to determine the structure of the research area, and compare the results with those reported here.

Second, the analysis performed using journal coupling excludes books, book chapters, and conference papers. This is a significant omission, since contributions other than articles remain important within information science and library science. This could be an interesting point of analysis for future work.

Third, although related to the previous limitation, in this paper, academic journals alone were used for research output representation. However, as suggested in (Garcia et al., 2012), “sets of journals cannot provide complete identifications of research output.” In future work, we will explore the incorporation of more complex entities for author representation.

DOI: https://doi.org/10.2478/jdis-2025-0006 | Journal eISSN: 2543-683X | Journal ISSN: 2096-157X
Language: English
Page range: 101 - 131
Submitted on: Sep 11, 2024
Accepted on: Nov 19, 2024
Published on: Dec 11, 2024
Published by: Chinese Academy of Sciences, National Science Library
In partnership with: Paradigm Publishing Services
Publication frequency: 4 issues per year

© 2024 Jose A. Garcia, Rosa Rodriguez-Sanchez, J. Fdez-Valdivia, published by Chinese Academy of Sciences, National Science Library
This work is licensed under the Creative Commons Attribution 4.0 License.