This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

11institutetext: Corresponding author: Xiangjie Kong; 11email: [email protected]

J. Liu
22institutetext: F. Xia 33institutetext: X. Kong44institutetext: Key Laboratory for Ubiquitous Network and Service Software of Liaoning Province, School of Software, Dalian University of Technology, China 55institutetext: T. Tang 66institutetext: Chengdu College, University of Electronic Science and Technology of China, China 77institutetext: A. Tolba 88institutetext: Z. AL-Makhadmeh99institutetext: Computer Science Department, Community College, King Saud University, Saudi Arabia 1010institutetext: A. Tolba1111institutetext: Mathematics Department, Faculty of Science, Menoufia University, Egypt

Understanding the Advisor-advisee Relationship via Scholarly Data Analysis

Jiaying Liu    Tao Tang    Xiangjie Kong    Amr Tolba    Zafer AL-Makhadmeh    Feng Xia
(Received: date / Accepted: date)
Abstract

Advisor-advisee relationship is important in academic networks due to its universality and necessity. Despite the increasing desire to analyze the career of newcomers, however, the outcomes of different collaboration patterns between advisors and advisees remain unknown. The purpose of this paper is to find out the correlation between advisors’ academic characteristics and advisees’ academic performance in Computer Science. Employing both quantitative and qualitative analysis, we find that with the increase of advisors’ academic age, advisees’ performance experiences an initial growth, follows a sustaining stage, and finally ends up with a declining trend. We also discover the phenomenon that accomplished advisors can bring up skilled advisees. We explore the conclusion from two aspects: (1) Advisees mentored by advisors with high academic level have better academic performance than the rest; (2) Advisors with high academic level can raise their advisees’ h-index ranking. This work provides new insights on promoting our understanding of the relationship between advisors’ academic characteristics and advisees’ performance, as well as on advisor choosing.

Keywords:
Academic networks Scholarly data Social network analysis Advisor-advisee relationship Collaboration patterns
journal: Scientometrics

1 Introduction

Scholarly data analytics has become a vital part of the scientific research (Letchford et al., 2015; Lehmann et al., 2006; Glänzel et al., 2017). Relationship extraction is a major area of interest in the scholarly social network analysis. The structure of the network usually contains a large number of nodes, with different relationships such as friendship, kinship, and hostility. Analyzing such relations can strengthen our understanding of the evolution and development of the scholarly society (Xia et al., 2017). One of the main relations is the mentorship because mentors are important for their protégés. Researchers have studied various aspects of mentorship including types, phases, outcomes and mobilities as early as 30 years ago (Chao et al., 1992; Kram, 1983). Although there is little doubt that the study of academic mentorship is necessary, it has been neglected because there is no complete advisor-advisee dataset. Each scholar needs guidance from the advisor when she/he enters academia as a newcomer. It is widely accepted that different advisors influence advisees differently (Murphy, 2015). However, where these differences are originated and how significant they are, both are worth exploring. The lack of quantitative academic mentorship analysis leads to the striking lack of specific guiding significance for advisors choosing.

The mentorship is universal and complex. Traditionally, a mentorship is defined as “A relationship between the inexperienced mentee and an experienced senior member of a particular field” (Dobson, 2013). It has different types and forms in different fields. For instance, it can be regarded as the relationship between employees and managers in public utility companies and advisor-advisee relationship in academia. There is a large volume of studies describing mentorship in various aspects. According to the formation of the relationship, Chao et al. (1992) first categorized mentorship into two types: formal mentorship and informal mentorship. Informal mentorship usually has neither compact structure with reasonable management, nor been recognized by the organization formally. In contrast to informal mentorship, formal mentorship occurs with external intervention from the organization. Usually it is led and managed by the organization. Mentorship is a temporary relationship with almost fixed duration, for example, the average length of advisor-advisee relationship in academic networks is five years (Kram, 1983). The phases of the mentorship can be defined as initiation, cultivation, separation, and redefinition on the basis of the particularly effective experience (Kram, 1983). In recent years, there has been an increasing amount of literature describing specific elements of mentorship, including behaviors of mentors and protégés, mentoring functions, structure of cooperation in the mentorship (Johnson and Ridley, 2015; Bozionelos et al., 2014; Borders et al., 2012), and so on. These studies provide complete insights into the development and implementation of the mentorship.

No matter what the form is, the ultimate goal of mentorship is to enhance the personal and professional development of protégés (Chao, 1997). Therefore, mentors have been taken as beacon lights in the development of protégés. Based on inherently dyadic attributes of the mentorship, the existing literature demonstrates that mentorship is positive and beneficial for both mentors and protégés. In a positive mentorship, the protégés will have better performance and more opportunities for promotion because their mentors provide them with personal and career assistance. Meanwhile, the mentors can receive fulfillment and gain satisfaction by improving protégés’ welfare (Ghosh and Reio, 2013; Hu et al., 2014). Furthermore, organizations benefit as well because there is a real possibility that protégés would like to be devoted to their organizations after graduation (Malmgren et al., 2010; Florea et al., 2013). Moreover, as early as 1989, Fagenson (1989) suggested that mentoring can enhance work validity and the career success. Singh et al. (2009) perceived to the point that compared with those who do not have a mentor, individuals who have obtain mentors embrace more chances to promote in their careers, and more likely to consider their future prospects ahead of time in more positive ways. Wanberg et al. (2006) proposed sometimes a mentor may change a protégé’s vocational tendency which can lead the protégé to the different lifeline.

The recent trends in mentorship analysis have led to a proliferation of studies focusing on the association between mentorship and the career development of protégés. Since Kram (1983) presented the psychosocial functions and career development functions to examine the mentors’ influence on the protégés, almost every paper focusing on the outcome of mentorship includes contents related to it. The mentoring functions have received considerable critical attention. On this basis, Chao et al. (1992) compared the mentoring functions among formal mentorships, informal mentorships, and non-mentored counterparts. Scandura and Ragins (1993) investigated the link between mentoring functions and the career mobility outcomes in terms of promotions and salaries. In a study conducted by Singh et al. (2009), it showed the correlation of rising star attributes measured at different periods, i.e., no-mentored period and a year after mentored. In the same vein, Chao (1997) examined linkages among different mentorship phases, functions and outcomes of protégés in order to integrate different aspects of mentoring into a more comprehensive theory. According to Young and Perrewe (2000), there were specific behaviors such as trust, related to career and social support the exhibited throughout the mentoring process. Furthermore,  Tuesta et al. (2015) found the evidence that there exists a positive correlation between time of advisor-advisee relationship and advisee’s productivity in the area of Exact and Earth Sciences.

Although research has been carried out on the mentors’ influence, with the exception of  Tuesta et al. (2015) who measured scientific productivity by the number of publications in journals, all others (Chao et al., 1992; Scandura and Ragins, 1993; Singh et al., 2009; Young and Perrewe, 2000; Chao, 1997) remain narrowly focus on dealing only with analytic study based on the feedback from the questionnaires. There are few data-based investigations studying the outcomes of choosing different advisors in academia. This gives rise to the question of how to choose an advisor when newcomers enter academia. In response to the question “What kind of advisors you will choose when you decide to pursue the PhD degree?”, most of the answers are non-specific and all-embracing. Almost all advisees want to choose advisors who make a great and positive influence on themselves. Young advisees in academia often seek to work in collaboration with top advisors in their field in pursuit of a successful career. However, “top advisors” in academia can be defined differently.

In this work, we analyze the relationship between advisees’ performance (i.e., productivity and impact) and advisors’ academic characteristics from the perspective of Computer Science. It is different from the existing research which only employs questionnaires. The dataset of advisor-advisee relationships is extracted from Digital Bibliography & Library Project (DBLP) dataset (Ley, 2009). We use the improved stacked autoencoder method based on the deep learning with the highest accuracy reaching up to 94% to get the complete dataset (Wang et al., 2016, 2017). Then we make a quantitative analysis of the dataset. We calculate the productivity and impact of advisees including the number of publications, citations, and h-indices in 1-12 years after first collaborating with their advisors. We separate the advisors into groups according to their academic age to observe the relation between advisees’ outcomes and advisors’ academic ages. We find that advisees’ number of publications, citations, and h-indices follow the same trend with the increasing of advisors’ academic ages, exhibiting an initial growth, remaining stationary for a duration, finally with a declining trend. Based on the observation, we examine the correlation between the ranking of advisors and their advisees’ h-indices.

Furthermore, by combining quantitative analysis with qualitative analysis, we find that advisors with high academic levels will bring up advisees with high academic performance. We conclude the findings from two main aspects: (1) Advisees mentored by the advisors with high academic level generally have better academic performance than the rest. We divide the advisors into two groups: one with a high ranking in terms of publications/citations/h-indices and the other without. By comparing the academic performance of all advisees mentored by different advisors, we observe that the phenomenon also holds for advisees with high academic level (if a scholar ranks the top 10% in the number of publications/citations/h-indices, then she/he has the higher academic achievement). (2) Advisors with high academic level can increase the probability of their advisees ranking the top 10% in h-index. We calculate the probability of an advisee ranking the top 10% in different cases. These cases are based on advisors’ different h-indices and academic ages. The outcomes of this quantitative research, which compare advisees’ achievements coached by advisors with the different academic performance we present here, can be utilized for advisor recommendation.

2 Methods

2.1 Dataset

Our large-scale dataset of advisor-advisee relationships is generated by applying a deep learning technique, stacked autoencoder, on the DBLP dataset (Wang et al., 2016, 2017). The specific steps for the advisor-advisee dataset acquisition are as follows:

  1. 1.

    Construct the collaboration ego-network for all scientists in the DBLP dataset.

  2. 2.

    Clean the network: Following Sinatra’s (Sinatra et al., 2016) steps, we only choose the scholars who have published at least 5 papers and who have a publication career span of at least 10 years in the DBLP dataset.

  3. 3.

    Extract the required features for training as unlabeled input of the model: Compute the personal properties (i.e., academic age, number of publications) and collaboration properties (i.e., collaboration times, collaboration duration, times of first-two authors, and cohesion of collaboration) for each scholar in the first 8 years of collaboration, then use these normalized features as the input of the model. The Back Propagation (BP) method is used to train the model and optimize the model. The result of identifying advisors is obtained through classifier after training.

  4. 4.

    Select the advisor-advisee pair whose identification accuracy exceeds 90% as the advisor-advisee dataset.

The output dataset consists of 810,469 advisor-advisee pairs and more than 1,475,000 publications from 1968 to 2016, with the highest computational precision reaching 94%. While DBLP only provides publication details for each scholar, the accomplishment in terms of citations cannot be obtained from DBLP. In order to obtain scholars’ citations and h-indices, we match our dataset with AMiner-DBLP (Tang et al., 2008). AMiner 111https://www.aminer.cn is a website which aims to provide comprehensive search and mining services for researcher social networks. It is considered as a widely used and one of the best-curated databases for Computer Science articles (Gollapalli et al., 2011; Moreira et al., 2011; Tang et al., 2008; Amjad et al., 2017). Currently, the system consists of more than 6,000 conferences, 3,200,000 publications, and 700,000 researcher profiles before 2016. In this website, developers use Microsoft Graph Search API to query each AMiner paper’s title and obtain candidate matching papers for each AMiner paper. For the computational accuracy, they randomly sampled 100,000 linking pairs and evaluated the matching accuracy. The number of truly matching pairs is 99,699 and the matching accuracy can achieve 99.70%.

Finally, the analysis is on the grounds of 15,559 advisors whose academic age is 5 (23,473 advisees), 20,859 advisors whose academic age is 10 (36,841 advisees), 17,028 advisors whose academic age is 15 (33,652 advisees), 11,522 advisors whose academic age is 20 (24,883 advisees), and 7,352 advisors whose academic age is 25 (11,305 advisees). The whole process of the experiment is illustrated in Fig. 1.

Refer to caption
Figure 1: Dataset acquisition process.

2.2 Ranking the scholars

We take three variables into consideration to rank a scholar: number of publications, citations, and h-indices. H-index of a scholar is calculated on the basis of the year of publications and referenced information each year. A scholar has an h-index hh if at least hh of his/her publications attract hh or more citations. In order to measure the advisors’ influence on advisees during different collaboration periods, the indicators for each year are calculated.

2.3 Analysis method

2.3.1 Academic age of scholars

When dividing the type of advisors, we take advisors’ academic ages into consideration because the academic characteristics such as citations and the number of publications are accumulated over time. Therefore, advisors with different academic ages should not be put together for comparison.

Definition 1

AA is defined as the academic age of scholars, i.e.,

AA=YcYf\ AA=Y_{c}-Y_{f}

where YfY_{f} is the year scholars published the first article and YcY_{c} is the investigated year.

In this paper, we calculate advisees’ and advisors’ academic ages of the year in which they began cooperating with each other. In other words, YcY_{c} is the year when they commenced collaboration.

2.3.2 The probability of advisees whose h-indices rank the top 10%

In order to observe the association between advisees’ academic performance and advisors’ academic level, we calculate the possibility that advisees mentored by different types of advisors with different h-index rankings and academic ages. The steps are described as follows:
(1) Divide the advisors according to the academic age when they first collaborated with their advisees. For advisors with the same academic age, divide them into Top10Top_{10} group in which advisors’ h-index ranking is in the top 10% and ResRes group for the rest.
(2) Calculate the total number of Top10Top_{10} advisors’ advisees NttNt_{t} and ResRes advisors’ advisees NtrNt_{r}.
(3) Rank all the advisees with the same academic age according to their h-indices.
(4) Calculate the minimum of the top 10% advisees’ h-index hminh_{min}.
(5) Calculate the number of Top10Top_{10} advisors’ advisees NhtNh_{t} whose h-indices are above hminh_{min}. In the same way, calculate NhrNh_{r} of ResRes advisors.
(6) The probability of Top10Top_{10} advisors’ advisees whose h-index rank top 10% can be calculated as Nht/NttNh_{t}/Nt_{t} and Nhr/NtrNh_{r}/Nt_{r} for ResRes advisors.

3 Results

In this section, we present the basic analysis of the advisor-advisee dataset in subsection 3.1. Subsection 3.2 elaborates the correlation between advisors’ academic characteristics and advisees’ academic performance. Subsection 3.3 shows the phenomenon that an accomplished advisor could bring up a skilled advisee. Finally, the analysis of the increasing publication rate is highlighted in subsection 3.4.

3.1 Statistical analysis of the advisor-advisee relationships

Table 1 summarizes the statistics of the advisors at the beginning of collaborating with their advisees. It illustrates the mean of advisors’ academic characteristics including the number of publications (NPNP), citations (NCNC), and h-indices (NHNH). Here we separate advisors according to their exact academic age (AAAA) when they begin guiding the advisees, to mitigate the bias of accumulated citations for advisors with different AAAA.

Table 1: Academic characteristics of advisors
Attributes AA=5AA=5 AA=10AA=10 AA=15AA=15 AA=20AA=20 AA=25AA=25 Average
NPNP 12.21 17.65 12.01 10.22 9.91 11.73
NCNC 57.11 145.2 195.05 272.74 319 160.4
NHNH 3.13 5.29 5.93 6.84 7.25 4.88

In order to carry out the data-based analysis of the advisor-advisee relationships, we keep a tally of relevant information of advisors and advisees in the dataset. Fig. 2 shows the range of each personal characteristic and number of advisors/advisees distribution of these characteristics.

Refer to caption
(a) Academic ages
Refer to caption
(b) Survival function for AAAA
Refer to caption
(c) Number of publications
Refer to caption
(d) Survival function for NPNP
Refer to caption
(e) Citations
Refer to caption
(f) Survival function for NCNC
Refer to caption
(g) H-indices
Refer to caption
(h) Survival function for NHNH
Figure 2: Academic performance distribution and survival function for advisors and advisees. The horizontal axis represents (a) academic ages, (c) the number of publications, (e) citations, and (g) h-indices, respectively. The vertical axis represents the number of scholars. In (b), (d), (f), and (h), the vertical axis represents the surviving proportion.

The red points represent advisors and blue ones represent advisees. Fig. 2(a) shows the academic age distribution of advisors and advisees since their first collaboration. As shown in Fig. 2(a), most advisors’ AAAA are 5-20 years. After that, as AAAA increases, the number of advisors decreases. Fig. 2(c), Fig. 2(e), and Fig. 2(g), respectively illustrate the distributions of advisees (10 years after first collaborating with their advisors) and advisors (when they began mentoring a certain advisee) in terms of NPNP, NCNC, and NHNH. We only consider the advisees whose career spans at least 10 years since they first collaborated with their advisors. It can be found from Fig. 2(c), Fig. 2(e), and Fig. 2(g), that there is a clear decreasing trend in the number of scholars with the increasing of NPNP, NCNC, and NHNH. We also plot the survival function, that is P(x>X)P(x>X) for these graphs, which will get rid of the noise and observe the differences between various groups. The results are presented in Fig. 2(b), Fig. 2(d), Fig. 2(f), and Fig. 2(h). In these figures, the horizontal axis represents scholars’ AAAA, NPNP, NCNC, and NHNH, respectively. The vertical axis is the survival rate. These survival curves have a declining trend. The steeper decline slope is on behalf of the lower survival rate. From these figures, we can see that all curves are concave. It illustrates that most scholars have relatively lower academic performance, few scholars have higher academic achievements, which can be evidenced by “Long Tail Effect” (Anderson, 2006).

Table 2: Exponential fitting of AAAA, NPNP, NCNC, and NHNH distribution for advisors and advisees
Attributes Fitting Function
Advisees’ AAAA y=y0+AeR0xy=y_{0}+Ae^{R_{0}x}
Advisors’ AAAA y=AeAXy=Ae^{-AX}
Advisees’ NPNP y=aebxy=ae^{bx}
Advisors’ NPNP y=y0+A1ext1y=y_{0}+A_{1}e^{\frac{x}{t_{1}}}
Advisees’ NCNC y=y0+Awπ2e2(xxc)2w2y=y_{0}+\frac{A}{w\sqrt{\frac{\pi}{2}}}e^{-2\frac{(x-x_{c})^{2}}{w^{2}}}
Advisors’ NCNC y=y0+A1ext1+A2ext2+A3ext3y=y_{0}+A_{1}e^{\frac{x}{t_{1}}}+A_{2}e^{\frac{x}{t_{2}}}+A_{3}e^{\frac{x}{t_{3}}}
Advisees’ NHNH y=y0+Awπ2e2(xxc)2w2y=y_{0}+\frac{A}{w\sqrt{\frac{\pi}{2}}}e^{-2\frac{(x-x_{c})^{2}}{w^{2}}}
Advisors’ NHNH y=y0+A1ext1y=y_{0}+A_{1}e^{\frac{x}{t_{1}}}
Refer to caption
(a) AAAA of advisees
Refer to caption
(b) AAAA of advisors
Refer to caption
(c) NPNP of advisees
Refer to caption
(d) NPNP of advisors
Refer to caption
(e) NCNC of advisees
Refer to caption
(f) NCNC of advisors
Refer to caption
(g) NHNH of advisees
Refer to caption
(h) NHNH of advisors
Figure 3: Fitted curves plot for AAAA, NPNP, NCNC, and NHNH. The horizontal axis represents (a) advisees’ AAAA, (b) advisors’ AAAA, (c) advisees’ NPNP, (d) advisors’ NPNP, (e) advisees’ NCNC, (f) advisors’ NCNC, (g) advisees’ NHNH, and (h) advisors’ NHNH, respectively. The vertical axis represents the number of scholars.

Graphs in Fig. 3 are the results obtained from fitting methods for Fig. 2. Specific fitting functions are summarised in Table 2. In Table 2, yy represents the number of advisees/advisors and xx represents AAAA/NPNP/NCNC/NHNH. In order to carry out effective fitting, we use different exponential functions to determine the relationship. In fact, the existing literature (Newman, 2001) has pointed out that the degree distribution of the scientists’ networks is between the exponential and the power-law. Obviously, our results are in consistent with aforementioned law.

3.2 Correlation between advisors’ academic characteristics and advisees’ academic performance

Fig. 4 shows the relationship between the advisors’ academic characteristics and advisees’ academic productivity as well as their impact. Academic productivity can be represented by the number of publications authored by one scholar, and academic impact can be approximated by a scholar’s h-index (Azoulay and Pierre, 2012; Owens, 2013). It is a comprehensive indicator to evaluate scholars. Fig. 4(a) shows the correlation between advisors’ AAAA and average NPNP, NCNC, and NHNH of their advisees 10 years after first collaborating with their advisors. It can be seen that the indicators have the same trend with the increasing of AAAA because these indicators have a certain dependence. All of them experience an initial growth, then remain stationary, and finally reach a decline phase. When advisors’ academic ages are in the range 1-9, advisees’ academic performance shows a linear growth trend. When advisors’ academic age reaches 28, advisees’ performance shows a downward trend. In the period when advisors’ academic age is between 10 and 27, advisees’ average achievements reach the highest level and basically remain unchanged.

It is interesting to consider the possible factors underlying this phenomenon. Prior studies have noted the importance of advisors’ career coaching and social support in advisees’ development. The current study (Wang et al., 2017) finds that the productivity of scholars is dynamic over the course of their own scientific career. For scholars whose AA<12AA<12, the annual productivity rises up slowly. If their AA>24AA>24, their productivity will decline slightly with the AAAA. In the range of 12<AA<2412<AA<24, their annual productivity are nearly fixed. Advisees’ average productivity/impact shows a linear growth trend because their advisors’ productivity in the range 1AA91\leq AA\leq 9 increases linearly. This reflects that the relationship between advisors’ career phases and advisees’ outcomes in terms of productivity and impact actually exists in the scientific mentorship. Our analyses indicate the relationship between collaborations with different AAAA advisors and the future career performance of advisees thus need deeper exploration.

Here are some possible reasons for our findings: (a) Academic research is a long-term accumulation process. Junior advisors need time to accumulate experience in conducting research and mentoring advisees. Compared with senior ones, their capacity and resources are limited. Senior advisors have cumulative advantages, both in academic performance and resources. (b) When the academic age of advisors is at the intermediate level (in the range of 10-27 years), their academic capability may reach a certain level as well. With sufficient resources, they can provide their advisees with the most helpful guidance. In addition to that, they are also willing to strive to instruct their advisees, because their own academic performance can be improved in this way. (c) Few senior advisors have a relatively long academic career (as shown in Fig. 2(a)), that’s why senior advisors of academic age above 35 have poor performing advisees. The random fluctuation may cause this phenomenon, and thus further study is required to untangle the underlying reasons. In general, the correlation between advisees’ high performance and advisors’ academic ages is certainly reasonable, but it is not the causation of scholars’ success. Hard work and passion are still essential to scientists’ success in the scientific career.

Refer to caption
(a) Academic ages
Refer to caption
(b) H-indices
Figure 4: Relationship between the advisors’ academic characteristics and advisees’ academic performance for 10 years since collaborating with their advisors. The horizontal axis represents (a) advisors’ academic age, (b) h-indices. In the subgraphs, the left vertical axis represents advisees’ average number of publications and citations. The right vertical axis represents their average h-index.

Similarly, in Fig. 4(b), the abscissa represents the advisors’ NHNH. In this subgraph, the left vertical axis represents advisees’ average NPNP, NCNC and right axis represents their average NHNH. Comparing with advisees’ NCNC and NHNH, advisees’ NPNP has the least obvious trend with the increase of advisors’ NHNH. The phenomenon elicits the truth that quantity is not equal to quality especially in academia. The maximum value of advisors’ h-index we have considered is 25 because there are few advisors with high NHNH values (see Fig. 2). The value of indicators for advisees’ fluctuates widely along with the growth of abscissa. So the data is binned in order to make the trend more clear.

3.3 An accomplished advisor brings up a skilled advisee

Refer to caption
(a) 5
Refer to caption
(b) 10
Refer to caption
(c) 15
Refer to caption
(d) 20
Refer to caption
(e) 25
Figure 5: Advisees’ h-indices whose advisors’ academic ages are (a) 5, (b) 10, (c) 15, (d) 20, (e) 25, as well as their h-indices are ranking at the top 5%, 5%-10%, 10%-15%, 15%-20%, 20%-25%. The polylines represent the trends of all advisees’ average h-index over time.

According to the influence of advisors’ academic ages on advisees, Fig. 5 indicates advisees’ h-indices whose advisors’ academic ages are 5, 10, 15, 20, 25, and the ranking of their h-indices begins at the top 5%, 5%-10%, 10%-15%, 15%-20%, 20%-25%, respectively. The polylines represent the trends of all advisees’ average h-index over time. It can be seen from the data in Fig. 5, advisors with different rankings in h-index can bring their advisees different h-indices. It is commonly perceived that high-achieving advisors literally cause their advisees higher academic productivity and impact. Selective bias is one possible reason as top advisors tend to select quality students. It is interesting to observe that as the advisors’ academic age rises, the gap among advisees narrows.

Refer to caption
(a) Number of publications
Refer to caption
(b) Publications’ difference
Refer to caption
(c) Citations
Refer to caption
(d) Citations’ difference
Refer to caption
(e) H-indices
Refer to caption
(f) H-indices’ difference
Figure 6: Average academic performance of all advisees supervised by Top10Top_{10} advisors and ResRes advisors. TnT_{n} corresponds to Top10Top_{10} advisors whose academic ages are nn. RnR_{n} corresponds to the rest whose academic ages are nn. (a), (c), and (e) show the advisees’ number of publications, citations and h-indices coached by different advisors, respectively. Details of their statistical mean and deviation are shown in (b), (d), and (f), respectively.

The advisees supervised by advisors whose h-index rank the top 10% always have the highest h-index. But it is not clear how significant the differences among these advisees are. On the basis of the definition of scholars’ impact, here we regard the top ten percent of advisors in h-index as Top10Top_{10} advisors. ResRes advisors represent the rest of advisors. We try to find out the differences between advisees mentored by Top10Top_{10} advisors or ResRes advisors. We convert the problem from a qualitative analysis to the quantitative analysis. Fig. 6 reflects advisees’ average NPNP, NCNC, and NHNH over time since advisees first collaborated with their advisors. In Fig. 6, advisors are differentiated with academic ages and h-indices. In the figures, TnT_{n} corresponds to Top10Top_{10} advisors whose academic ages are nn while RnR_{n} corresponds to ResRes advisors whose academic ages are nn. From the figures, we can see that advisors’ NPNP results in the least obvious discrepancy between advisees supervised by Top10Top_{10} advisors and ResRes advisors. The most striking observation to emerge from the data comparison is the growth rate of each indicator. Advisees mentored by Top10Top_{10} advisors have higher NPNP, NCNC, and NHNH than the others. Moreover, their growth rate is higher. This result may be explained by the Matthew Effect (Langfeldt et al., 2015). In academia, while excellent scholars’ accomplishments and reputation tend to snowball, those with modest accomplishments have greater difficulty to improve their impact.

In Fig. 7, similarly, which is consistent with assessment standards of advisors, the advisees whose h-index rank the top 10% of each indicator are considered as the Top10Top_{10} advisees. The differences between Top10Top_{10} advisees’ NPNP, NCNC, and NHNH, who are mentored by Top10Top_{10} advisors or ResRes advisors are highlighted in Fig. 7(d), Fig. 7(e), and Fig. 7(f). To the same extent as the previous results, comparing with NCNC and NHNH, the discrepancy between advisees’ average NPNP mentored by Top10Top_{10} advisors or ResRes advisors is not obvious. However, considering academic ages of the advisors, Top10Top_{10} advisees mentored by Top10Top_{10} advisors whose academic age is 30 have the lowest average NHNH. For ResRes advisors, the same scenario occurs to advisors whose academic ages are 5. It is different from advisors’ NCNC and NHNH. This phenomenon gives us inspiration that only considering the number of publications to evaluate scholars is insufficient because quantity is not always equal to quality. As a result, a number of comprehensive indices have appeared to evaluate scholars. In Fig. 6 and Fig. 7, it can be seen from the subgraphs that comparing with all of the advisees, Top10Top_{10} advisees mentored by Top10Top_{10} advisors or ResRes advisors make more distinct values in the average NPNP, NCNC, and NHNH.

Refer to caption
(a) Number of publications
Refer to caption
(b) Publications’ difference
Refer to caption
(c) Citations
Refer to caption
(d) Citations’ difference
Refer to caption
(e) H-indices
Refer to caption
(f) H-indices’ difference
Figure 7: Top10Top_{10} advisees’ academic performance supervised by Top10Top_{10} advisors and ResRes advisors. (a), (c), and (e) show the Top10Top_{10} advisees’ number of publications, citations and h-indices coached by different advisors. Differences between Top10Top_{10} advisees’ NPNP, NCNC, and NHNH who are mentored by Top10Top_{10} advisors or ResRes advisors are highlighted in (b), (d), and (f).

The results of the correlational analysis among advisor-advisee’s collaboration duration, advisors’ academic ages, proportion of advisees’ h-index ranking the top 10% are shown in Fig. 8. The data in first three years of collaboration with advisors whose academic age is 5 are missing because of the slow growth rate in this period. It is difficult to calculate the top 10% advisees’ h-indices because the values of their h-indices are relatively low and similar. For ResRes advisors, the proportion of advisees ranking top 10% is shown in the left subgraph in Fig. 8(a), and the result of Top10Top_{10} advisors is summarized in the right subgraph. Fig. 8(b) shows the proportion of differences between advisees mentored by Top10Top_{10} advisors and ResRes advisors. The values of advisors whose academic ages are 30 in Fig. 8(b) are always below 0.1. For others, their values are all above 0.1, even the maximum value can reach 0.28. It indicates that most Top10Top_{10} advisees are coached by Top10Top_{10} advisors.

Refer to caption
(a) Proportion
Refer to caption
(b) Proportion differences
Figure 8: Correlations among advisor-advisee’s collaboration duration, advisors’ academic age, the proportion of advisees’ h-index ranking the top 10%. (a) Advisees’ probability of h-index ranking the top 10% mentored by ResRes and Top10Top_{10} advisors. (b) The proportion of differences between advisees mentored by Top10Top_{10} advisors and ResRes advisors.

Selective bias is a possible explanation for these phenomena. The advisor-advisee relationship is a two-way choice relationship, quality advisees tend to seek top advisors and vice versa. Moreover, we have also found that the gap between these advisees is closely related to the advisors’ academic age and the time duration of their collaboration. In the first three years of collaboration, the gap between advisees is relatively small. Since the fourth year, the gap begins to grow wider.

3.4 Publication rates analysis

As mentioned before, we have separated advisors according to their AAAA in the experiment. However, considering the publication rates increased over time, it may have a biased effect on the scholars’ number of publications, citations, and h-indices. So we carry out the experiment to verify whether the publications rates impact our findings.

Refer to caption
(a) New records per year
Refer to caption
(b) 1996-2002
Refer to caption
(c) 2003-2007
Figure 9: Advisees’ h-indices whose advisors’ AAAA is 20 in different time periods, as well as their h-indices are ranking at the top 5%, 5%-10%, 10%-15%, 15%-20%, 20%-25%. (b) Advisees’ average h-index mentored by advisors whose AAAA is 20 in 1996-2002. (c) Advisees’ average h-index mentored by advisors whose AAAA is 20 in 2003-2007.

Fig. 9(a) displays the number of new entries in the database per year. From the data in Fig. 9(a), it is apparent that the growth rate of the publication is different at different stages. The number of new records increases sharply in 2003 and 2011. To be more specific, there are less than 25,000 new publications per year in 1999-2002 but over 50,000 in 2003-2007, almost two times more than the first period. In order to ensure all of the advisees have plenty of time to accumulate enough publications, we choose advisors whose academic age is 20 in 1996-2002 and 2003-2007, respectively. We compare the average academic performance (h-index) of their advisees. In Fig. 9(b) and Fig. 9(c), the polylines show the advisees’ average h-index mentored by different h-index ranking advisors. Data in these figures can be compared with the data in Fig. 5, which is the basis of our findings. A positive correlation can be found between advisors’ and advisees’ academic performance. So the publication rates have limited influence on the phenomena we have found in this work.

4 Discussion

Moreover, we have processed our dataset and do experiments on the part of the real dataset to explore the correlation between the advisor and their academic grandchildren. We extract 4,256 advisor-advisee pairs (645 advisors) and compile the statistics on the data. For advisors, we still divide them into two groups according to their h-indices: Top10Top_{10} advisors and ResRes advisors. And then we analyze their grandchildren’s impact. We discover that the average h-index of Top10Top_{10} advisors’ grand-advisees is 40% higher than the grand-advisees mentored by Res advisors. It illustrates that there may be also a correlation existing between them and it needs in-depth research. Usually, PhD students are guided directly by their advisors, thus we only focus on the direct advisor-advisee relationships. Through this analysis, a comparatively deep analysis should also be made on the relationship between the advisor and their academic grandchild. We will carry out an in-depth study of this issue in future work.

There are still a few limitations in this work. Firstly, the conclusions are efficacious only for Computer Science, which is a rapidly developing discipline and has the characteristic of spreading its knowledge in conferences. It would be interesting to explore the anatomy of the relationship in other traditional areas of knowledge if we can get the large-scale advisor-advisee relationships dataset for the discipline. Secondly, we only show the correlation between advisors’ academic characteristics and advisees’ academic performance. Other factors may also produce these phenomena, such as selective bias and the rank of institutions. It is necessary to further explore the cause of these phenomena. Thirdly, it is unfortunate that the study only illustrates the advisors’ benefits derived from advisees on the formal relations. In terms of relevant experience, the informal advisor-advisee relationship is also an important part of mentorship. In future work, through the continuous improvement of the dataset, we will conduct the in-depth study on the informal advisor-advisee relationships.

5 Conclusion

This study presents advisees’ potential academic performance of choosing different advisors with different academic ages or academic levels in the field of Computer Science. It has identified the relationship between advisors’ academic age and advisees’ performance, which experiences an initial growth, follows a sustaining stage, and finally ends up with a declining trend. The second major finding is that accomplished advisors can bring up skilled advisees, which is evidenced by advisees’ academic performance advised by different advisors. Taken together, these findings suggest a significant role for advisors in promoting their advisees. This research extends our knowledge of scholars’ career success and will serve as a base for future studies.

In summary, through the analysis of the relationship between advisors and advisees, the findings have significant implications to understand the relationship between advisors’ academic characteristics and advisees’ performance. Moreover, it can shed light on the advisor recommendation. However, it should be noted that the results may differ in different research fields, it would be interesting to explore the anatomy of the relationships in other areas of knowledge.

Acknowledgment

The authors extend their appreciation to the Deanship of Scientific Research at King Saud University for funding this work through research group NO (RGP-1438-27).

References

  • (1)
  • Amjad et al. (2017) Amjad, T., Ding, Y., Xu, J., Zhang, C., Daud, A., Tang, J. and Song, M. (2017), ‘Standing on the shoulders of giants’, Journal of Informetrics 11(1), 307–323.
  • Anderson (2006) Anderson, C. (2006), The long tail: Why the future of business is selling less of more, Hachette Books.
  • Azoulay and Pierre (2012) Azoulay and Pierre (2012), ‘Research efficiency: Turn the scientific method on ourselves.’, Nature 484(7392), 31–2.
  • Borders et al. (2012) Borders, L. D., Wester, K. L., Granello, D. H., Chang, C. Y., Hays, D. G., Pepperell, J. and Spurgeon, S. L. (2012), ‘Association for counselor education and supervision guidelines for research mentorship: Development and implementation’, Counselor Education and Supervision 51(3), 162–175.
  • Bozionelos et al. (2014) Bozionelos, N., Bozionelos, G., Polychroniou, P. and Kostopoulos, K. (2014), ‘Mentoring receipt and personality: Evidence for non-linear relationships’, Journal of Business Research 67(2), 171–181.
  • Chao (1997) Chao, G. T. (1997), ‘Mentoring phases and outcomes’, Journal of Vocational Behavior 51(1), 15–28.
  • Chao et al. (1992) Chao, G. T., Walz, P. and Gardner, P. D. (1992), ‘Formal and informal mentorships: A comparison on mentoring functions and contrast with nonmentored counterparts’, Personnel Psychology 45(3), 619–636.
  • Dobson (2013) Dobson, L. (2013), ‘Effective practices of formal mentoring programs’, Math Alliance Research Study 3, 1–3.
  • Fagenson (1989) Fagenson, E. A. (1989), ‘The mentor advantage: Perceived career/job experiences of protégés versus non-protégés’, Journal of Organizational Behavior 10(4), 309–320.
  • Florea et al. (2013) Florea, L., Cheung, Y. H. and Herndon, N. C. (2013), ‘For all good reasons: Role of values in organizational sustainability’, Journal of Business Ethics 114(3), 393–408.
  • Ghosh and Reio (2013) Ghosh, R. and Reio, T. G. (2013), ‘Career benefits associated with mentoring for mentors: A meta-analysis’, Journal of Vocational Behavior 83(1), 106–116.
  • Glänzel et al. (2017) Glänzel, W., Heeffer, S. and Thijs, B. (2017), ‘Lexical analysis of scientific publications for nano-level scientometrics’, Scientometrics pp. 1–10.
  • Gollapalli et al. (2011) Gollapalli, S. D., Mitra, P. and Giles, C. L. (2011), Ranking authors in digital libraries, in ‘Proceedings of the 11th annual international ACM/IEEE joint conference on Digital libraries’, ACM, pp. 251–254.
  • Hu et al. (2014) Hu, C., Wang, S., Yang, C.-C. and Wu, T.-y. (2014), ‘When mentors feel supported: Relationships with mentoring functions and protégés’ perceived organizational support’, Journal of Organizational Behavior 35(1), 22–37.
  • Johnson and Ridley (2015) Johnson, W. B. and Ridley, C. R. (2015), The elements of mentoring, Macmillan.
  • Kram (1983) Kram, K. E. (1983), ‘Phases of the mentor relationship’, Academy of Management Journal 26(4), 608–625.
  • Langfeldt et al. (2015) Langfeldt, L., Benner, M., Sivertsen, G., Kristiansen, E. H., Aksnes, D. W., Borlaug, S. B., Hansen, H. F., Kallerud, E. and Pelkonen, A. (2015), ‘Excellence and growth dynamics: A comparative study of the matthew effect’, Science and Public Policy p. scu083.
  • Lehmann et al. (2006) Lehmann, S., Jackson, A. D. and Lautrup, B. E. (2006), ‘Measures for measures’, Nature 444(7122), 1003–1004.
  • Letchford et al. (2015) Letchford, A., Moat, H. S. and Preis, T. (2015), ‘The advantage of short paper titles’, Royal Society open science 2(8), 150266.
  • Ley (2009) Ley, M. (2009), ‘DBLP: Some lessons learned’, Proceedings of the VLDB Endowment 2(2), 1493–1500.
  • Malmgren et al. (2010) Malmgren, R. D., Ottino, J. M. and Amaral, L. A. N. (2010), ‘The role of mentorship in protégé performance’, Nature 465(7298), 622–626.
  • Moreira et al. (2011) Moreira, C., Calado, P. and Martins, B. (2011), Learning to rank for expert search in digital libraries of academic publications, in ‘Portuguese Conference on Artificial Intelligence’, Springer, pp. 431–445.
  • Murphy (2015) Murphy, T. H. (2015), ‘On great teachers’, Journal of Legal Studies Education 32(1), 223–227.
  • Newman (2001) Newman, M. E. (2001), ‘The structure of scientific collaboration networks’, Proceedings of the National Academy of Sciences 98(2), 404–409.
  • Owens (2013) Owens, B. (2013), ‘Research assessments: Judgement day.’, Nature 502(7471), 288–90.
  • Scandura and Ragins (1993) Scandura, T. A. and Ragins, B. R. (1993), ‘The effects of sex and gender role orientation on mentorship in male-dominated occupations’, Journal of Vocational Behavior 43(3), 251–265.
  • Sinatra et al. (2016) Sinatra, R., Wang, D., Deville, P., Song, C. and Barabási, A.-L. (2016), ‘Quantifying the evolution of individual scientific impact’, Science 354(6312), aaf5239.
  • Singh et al. (2009) Singh, R., Ragins, B. R. and Tharenou, P. (2009), ‘Who gets a mentor? A longitudinal assessment of the rising star hypothesis’, Journal of Vocational Behavior 74(1), 11–17.
  • Tang et al. (2008) Tang, J., Jin, R. and Zhang, J. (2008), A topic modeling approach and its integration into the random walk framework for academic search, in ‘Data Mining, 2008. ICDM’08. Eighth IEEE International Conference on’, IEEE, pp. 1055–1060.
  • Tang et al. (2008) Tang, J., Zhang, J., Yao, L., Li, J., Zhang, L. and Su, Z. (2008), Arnetminer: Extraction and mining of academic social networks, in ‘Proceedings of the 14th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining’, ACM, pp. 990–998.
  • Tuesta et al. (2015) Tuesta, E. F., Delgado, K. V., Mugnaini, R., Digiampietri, L. A., Mena-Chalco, J. P. and Pérez-Alcázar, J. J. (2015), ‘Analysis of an advisor-advisee relationship: An exploratory study of the area of exact and earth sciences in brazil’, PloS one 10(5), e0129065.
  • Wanberg et al. (2006) Wanberg, C. R., Kammeyer-Mueller, J. and Marchese, M. (2006), ‘Mentor and protégé predictors and outcomes of mentoring in a formal mentoring program’, Journal of Vocational Behavior 69(3), 410–423.
  • Wang et al. (2017) Wang, W., Liu, J., Xia, F., King, I. and Tong, H. (2017), Shifu: Deep learning based advisor-advisee relationship mining in scholarly big data, in ‘Proceedings of the 26th International Conference on World Wide Web Companion’, International World Wide Web Conferences Steering Committee, pp. 303–310.
  • Wang et al. (2016) Wang, W., Liu, J., Yu, S., Zhang, C., Xu, Z. and Xia, F. (2016), Mining advisor-advisee relationships in scholarly big data: A deep learning approach, in ‘Proceedings of the 16th ACM/IEEE-CS on Joint Conference on Digital Libraries’, ACM, pp. 209–210.
  • Wang et al. (2017) Wang, W., Yu, S., Bekele, T. M., Kong, X. and Xia, F. (2017), ‘Scientific collaboration patterns vary with scholars academic ages’, Scientometrics 112(1), 329–343.
  • Xia et al. (2017) Xia, F., Wang, W., Bekele, T. M. and Liu, H. (2017), ‘Big scholarly data: A survey’, IEEE Transactions on Big Data 3(1), 18–35.
  • Young and Perrewe (2000) Young, A. M. and Perrewe, P. L. (2000), ‘What did you expect? an examination of career-related support and social support among mentors and protégés’, Journal of Management 26(4), 611–632.