(sorry for the long cut and paste, but Science magazine is subscription only)

Network Theory--the Emergence of the Creative Enterprise
by Albert-László Barabási

www.sciencemag.org/cgi/cont.../5722/639

In the Foundation Trilogy, Isaac Asimov placed psychohistorian Harry Seldon so far into the future that Earth, the birthplace of the Galactic civilization, has been forgotten (1). Indeed, Star Trek's teleporting characters appear far more grounded in reality than Seldon's mathematical equations that accurately predict the multigalactic society's fate thousands of years into the future. Today, when reports about quantum teleportation fill the pages of the best physics journals, we wonder how long it will be until a real Harry Seldon produces an accurate mathematical theory of human behavior.

It may be hard to believe, but conditions for such a quantitative approach are increasingly in place. Indeed, records of human actions are already stored in numerous databases. E-mail and phone records document our social and professional interactions; travel records and GPS navigation systems capture our travel patterns and physical locations; credit-card companies maintain records of our shopping and entertainment habits. Although in the wrong hands, these data sets represent Orwellian tools of power, for scientists they offer incredible insights into human behavior. Combine this capability with the sophisticated tool of network theory (2-7), which analyzes relations between millions of individuals, and you get a glimpse of an unprecedented opportunity to quantify human dynamics. Although a mathematical theory of social complexity remains a pipe dream, it is not as farfetched as it may have appeared in 1942, when Foundation was first published. Proof of this can be found in the study by Guimerà et al. on page 697 of this issue (8). By taking advantage of publicly available data sets from both artistic and scientific fields, the authors offer powerful insights into the mechanisms governing collective human behavior.

Traditionally, the achievements of individuals such as Darwin and Einstein have dominated the public's image of science, yet today some of the most groundbreaking work is collaborative in nature (see the figure). But how do such creative teams come about? Are there discernible differences between collaborations that are sparklingly creative and those that are less inventive? Guimerà et al. use network theory to answer these questions. Their starting point is a collection of fascinating data sets: a century-long record of Broadway musicals and the publication records of several fields of science. These data sets allowed them to reconstruct the collaborative history of the individuals who contributed to a particular show or research publication. The investigators document a changing creative enterprise in which advances require an increasing number of contributors. The history of Broadway is particularly illuminating: The team size responsible for producing a show increased until the 1930s, after which it leveled off, fluctuating at around seven contributors for the past 70 years. In contrast, science continues to search for its optimal collaborative setup: The number of coauthors in each scientific field has increased monotonically during the past decade. It is anyone's guess when and where it will reach a maximum.

Figure 1 Evolution of the scientific enterprise. (Left) For centuries, creative individuals were embedded in an invisible college, that is, a community of scholars whose exchange of ideas represented the basis for scientific advances. Although intellectuals built on each other's work and communicated with each other, they published alone. Most great ideas were attributed to a few influential thinkers: Galileo, Newton, Darwin, and Einstein. Thus, the traditional scientific enterprise is best described by many isolated nodes (blue circles). (Middle) In the 20th century, science became an increasingly collaborative enterprise, resulting in such iconic pairs as the physicist Crick and the biologist Watson (left), who were responsible for unraveling DNA's structure. The joint publications documenting these collaborations shed light on the invisible college, replacing the hidden links with published coauthorships. (Right) Although it is unlikely that large collaborations--such as the D0 team in particle physics or the International Human Genome Sequencing Consortium pictured here--will come to dominate science, most fields need such collaborations. Indeed, the size of collaborative teams is increasing, turning the scientific enterprise into a densely interconnected network whose evolution is driven by simple universal laws.

PHOTO CREDITS: (LEFT) NEWTON AND GALILEO/AIP EMILIO SEGRE VISUAL ARCHIVES; DARWIN/SOTHDBY'S; (MIDDLE) A. BARRINGTON BROWN/PHOTO RESEARCHERS; (RIGHT) LARRY THOMPSON/NIH

Until the late 1990s, the bulk of network research focused on static properties, which do not change with time (9). Yet a proper understanding of most networks requires that we characterize the assembly process that generated them. Indeed, a map of such networks is not sufficient to understand the structure of the World Wide Web--we must describe how documents and links are added and removed (3). Uncovering all interactions between proteins is only the first step toward understanding cellular networks--we must also explore the importance of gene duplications and mutations that shape the interactions between proteins and genes (10). Similarly, to comprehend the structure of the collaboration map, we must understand how people form friendships and alliances. Given that in the professional world friendships are just as crucial as hard-nosed professional interests, modeling the evolution of creative teams may appear to be impossible. Guimerà's results indicate otherwise: They show that a simple model successfully captures many qualitative features of the network underlying the creative enterprise. In their study, they distinguish between veterans, who have participated in collaborations before, and rookies, who are about to see their names appear in print for the first time. Two parameters are key: the fraction of veteran members in a new team, and the degree to which veterans involve their former collaborators. If choosing experienced veterans is not a priority, the authors find that the network will be broken up into many small teams with little overlap between them. As the likelihood of relying on veterans increases, thanks to the extra links to earlier collaborators, the teams coalesce through a phase transition such that all players become part of a single cluster.

Many professional networks--from the web of actors in Hollywood to scientific collaborations (11, 12)--are scale-free (13), that is, although most individuals have only a few collaborators, a few have hundreds and operate as hubs. The legendary Paul Erdös, the father of random network theory, with more than 500 collaborators, was probably the best known hub within mathematics. The model that Guimerà and co-workers propose does indeed account for hubs, the emergence of which is rooted in the rookies' desire to involve their friends in new teams. Indeed, the more collaborators an individual has, the higher the chances are that he or she will be invited to participate again. This process--called preferential attachment in network theory--is responsible for the emergence of hubs through a rich-gets-richer process (13) in which well-connected individuals continue to be in high demand.

How does this assembly process affect the team's performance? The results of the Guimerà et al. study indicate that expertise does matter: Teams publishing in high-impact journals have a high fraction of incumbents. But diversity matters too: Teams with many former collaborative links offer inferior performance. Thus, the recipe for success seems relatively simple: When forming a "dream team" make an effort to include the most experienced people, whether or not you have worked with them before. The temptation to work mainly with friends will eventually hurt performance.

In Asimov's classic story, Harry Seldon's theory could not handle innovation. To stay on the predictive side, the Foundation went to great lengths to freeze all technological development. Indeed, the most disruptive social changes humanity has experienced are intimately tied to new technologies, from the steam engine to the Internet. It is tempting to conclude, therefore, that given the unpredictability of potential technologies, a theory of human dynamics will have no chance of success until scientific innovation ceases. A more constructive approach, and one taken by Guimerà at al., takes us in the opposite direction, bringing innovation into a scientific and mathematical perspective.

Finally, will there ever be a Harry Seldon and a mathematical theory of human behavior? It is easy to maintain that human actions are too complex to be predictable. But skeptics are proven wrong each time a waiter brings them ketchup with their fries, without having been asked to. A master of consumer behavior, the waiter concludes that very likely they will ask for it. In the same way, a data-driven understanding of human actions could help us to translate into a predictive mathematical language the fundamental principles that drive a society's collective behavior. In a world in which all events are recorded by computers, the conditions for this research are increasingly in place. The quantitative accumulation of such data could easily spark a qualitative change, helping many disparate facts to fall into a coherent predictive theory. By demonstrating that the Web, the cell, or society is driven by similar organizing principles, network theory offers a successful conceptual framework to approach the structure of many complex systems. Perhaps a predictive framework that captures the dynamics and the behavior of networks is not too far behind either. In the light of Guimerà et al.'s results, we can be sure of one thing: While pursuing a theory of human behavior, we could overlook a Harry Seldon. A mathematical theory of human dynamics may not be the solitary achievement of a genius scientist (14), but will likely emerge from the combined efforts of an expert team with just the right combination of expertise, collaborative experience, and fresh ideas.

References

1. I. Asimov, Foundation and Empire (Spectra, New York, 1991) [publisher's information].
2. S. N. Dorogovtsev, J. F. F. Mendes, Evolution of Networks: From Biological Nets to the Internet and WWW (Oxford Univ. Press, New York, 2003) [publisher's information].
3. A. Pastor-Satorras, A. Vespignani, Evolution and Structure of the Internet: A Statistical Physics Approach (Cambridge Univ. Press, Cambridge,UK, 2004) [publisher's information].
4. R. Albert, A.-L. Barabási, Rev. Mod. Phys. 74, 47 (2002) [APS].
5. S. H. Strogatz, Nature 410, 268 (2001) [Medline].
6. E. Ben-Naim, H. Frauenfelder, Z. Toroczkai, Complex Networks (Springer Verlag, Berlin, 2004) [publisher's information].
7. S. Bornholdt, H. G. Schuster, Eds., Handbook of Graphs and Networks: From the Genome to the Internet (Wiley-VCH, Weinheim, Germany, 2003) [publisher's information].
8. R. Guimerà, B. Uzzi, J. Spiro, L. A. Nunes Amaral, Science 308, 697 (2005).
9. B. Bollobás, Random Graphs (Cambridge Univ. Press, Cambridge, UK, 2001) [publisher's information].
10. A.-L. Barabási, Z. N. Oltvai, Nat. Rev. Genet. 5, 101 (2004) [Medline].
11. M. E. J. Newman, Phys. Rev. E 64, 016131 (2001) [APS].
12. A.-L. Barabási et al., Physica A 311, 590 (2002) [Abstract].
13. A.-L. Barabási, R. Albert, Science 286, [509] (1999).
14. R. Collins, The Sociology of Philosophies (Harvard Univ. Press, Cambridge, MA, 1998) [publisher's information].
posted by:
Daniel
Seattle