709
Views
0
CrossRef citations to date
0
Altmetric
Research Article

Collaborative Coding Cultures: How Journalists Use GitHub as a Trading Zone

ORCID Icon, ORCID Icon, ORCID Icon, ORCID Icon & ORCID Icon

Abstract

As the use of “big data” in journalism has become an increasing practice in recent years, its exercise is no longer limited to newsrooms in Europe and North America. The prevalence of coding cultures and collaboration in newsrooms worldwide has risen, in part to tackle challenges related to information overflow. This collaboration has been facilitated partly by the popularization of low-cost code hosting platforms such as GitHub. GitHub is a well-known public code management system with a significant potential for transparency and collaboration between communities. This study aims to map how newsrooms employ GitHub as a trading zone for collaborative journalism. We analyze the GitHub activities of leading media outlets from 27 countries. Based on content analysis of the newsroom’s repositories and users’ profiles, we found that media organizations in the United Kingdom and the United States continue to be the primary code producers in GitHub. However, our results show that newsrooms in developing countries benefit from these codes by stargazing these projects. The results of this article have important implications for understanding how newsrooms across the globe use GitHub for collaboration and the role journalistic code plays in bringing together various collaborative initiatives and actors in journalism.

Introduction

Collaboration across news organizations is increasingly assumed to be necessary and desirable during the practice of data-driven journalism (Alfter Citation2019; Sambrook et al. Citation2018). Indeed, in an age of “big data” where information is abundant, the sheer volume of available data sets that are relevant for reporting but in need of high-skilled analysis has prodded media workers to participate in collaborative projects to tackle more complex reporting (Lewis and Nashmi Citation2019; Loosen, Reimer, and De Silva-Schmidt Citation2020). Some of these projects have gained significant international attention during the last decade, including reporting on the leak of over 11 million documents in the Panama and Paradise Papers case or the fact-checking initiatives during the COVID-19 global health pandemic (for an overview see e.g., Alfter and Cândea Citation2019; Carson and Farhall Citation2018).

The inter-organizational collaborative phenomenon in reporting, that is, between two media organizations, also points to an emergent form of collaborative journalism. Cooperation in journalism is in no way new (Dodds Citation2019). However, this new type of collaborative journalism is characterized by at least two novel features. Firstly, we are observing new methods—from network analysis to graph-based keyword search systems—for the reporting, coordination, and collaboration between media organizations under the frame of a new “global network of journalism” (Berglez and Gearing Citation2018) that has the potential to break down professional silos in journalism (Heft Citation2021). Secondly, this new form of reporting further embodies the diversification of professional roles inside newsrooms. To avoid information asymmetry—newsrooms’ inability to gather, analyze, or disseminate information due, not to the lack of it, but to the affluence of data points (Gray, Chambers, and Bounegru Citation2012)—new computer programmers, graphic designers, entrepreneurs, and data scientists have been included in the decision-making process of news production like never before (Brossi and Dodds Citation2019; Lewis and Usher Citation2014).

These characteristics in collaborative journalism allow contemporary multimodal newsrooms to share raw data and information, as well as the elevated costs associated with high-skilled analysis. As Carson and Farhall argue after examining award-winning cases in Britain, the United States, and Australia, “investigative collaborations and digital approaches are powerful antidotes to declining revenues and falling journalist numbers as we move deeper into the twenty-first century” (2018, 1909). In times of already dwindling trust levels towards the press and click-based business models, collaborative journalism can improve small and disadvantaged newsrooms’ practices and motivations, as well as their access to investigative projects and public-interest data (Dodds Citation2021).

Furthermore, collaborative journalism is also gaining traction for yet another reason. Studies in the United States have shown that this type of reporting could boost transparency and accountability by showing the methods journalists use to construct their stories (Curry and Stroud Citation2021; Lewis and Usher Citation2013). Journalistic programming code helps the audience understand the process behind the stories they are reading. It also works as a script for other newsrooms that would like to analyze similar data in local contexts.

Nevertheless, despite the increasing importance of collaboration in journalism, “very few have talked about data journalism in non-Western contexts—at least not in an academic setting, leaving many wondering whether it exists” (Mutsvairo Citation2019). Indeed, despite the widespread use of data in journalism worldwide (Wachs et al. Citation2022), research has traditionally only shone a light on particular countries in North America and Western Europe (Appelgren, Lindén, and van Dalen Citation2019). By omitting media systems outside of Western Europe and North America, academic research has created a geographical gap in the study of data-driven journalism and focused chiefly on liberal and democratic corporatist media systems. This Western-centric approach to the study of innovation in journalism has failed to include news organizations that deal with “big data” under strictly regulated media environments (Wu Citation2022) or are run by grassroots and marginalized communities (Dodds Citation2021). Moreover, media organizations in Western, educated, industrialized, rich, and democratic countries have more resources because of their more affluent memberships and higher income through advertising. As the costs of practicing data journalism increase due to the need to hire high-skilled analysts, data journalism will likely be implemented differently in developing countries. To understand how code mediates newswork, we need to observe how different actors are appropriating codes, scripts, and programming languages to their realities and sharing their experiences with other news organizations.

This paper aims to overcome these limitations and reveal the diversity, similarities, and differences between collaborative coding cultures in journalism across the globe. Collaborative coding cultures result from the popularization of open-source code platforms, and journalists need to analyze and report the abundance of public interest data. Depending on the specific conditions of any given country, such as their levels of data access, privacy laws, or information rights, new relational and dynamic values have emerged in newsrooms across African, Middle Eastern, Asian, and Latin American countries (Bebawi Citation2016; Broussard and Boss Citation2018; Wright, Zamith, and Bebawi Citation2019). GitHub, one of the most popular platforms for sharing and collaboratively developing open-source code, has emerged as one of media organizations’ most popular trading zones. Its popularity has increased so much during the last years that, according to Davis, “GitHub has […] become an active social network in and of itself, where people with shared interests can follow each other and see the progress on various projects” (2015, 162). As an example of a collaborative coding culture, GitHub enables developers, journalists, and researchers with different professional backgrounds to track programming changes and collaborate on source-code management within its platform.

Founded in 2008 and now with over 80 million users (Gonzalez, Zimmermann, and Nagappan Citation2020), GitHub provides access to remote repositories for Git—a version control system that allows users to keep track of the changes made in their code—a feature that results particularly beneficial in team-based work developments.

This paper further aims to map how newsrooms across a range of countries employ GitHub as a trading zone for collaborative journalism, if they do it at all. The lack of studies about collaborative coding for data journalism in non-Western contexts gives this article an exploratory nature. To guide this research, we discuss the following main question: How are news organizations using the code-sharing platform GitHub? To answer this main question, we developed the following sub-research questions: (1) To which media organizations belong the most prominent journalistic repositories on GitHub? (2) What type of journalistic projects and themes are most common on GitHub? (3) Who are the primary actors interacting with journalistic repositories on GitHub?

Collaboration Cultures and Trading Zones

GitHub is one of the key platforms that afford digital trading zones for journalism. As more data becomes available for newsrooms to gather, analyze, and present to the audience, there has been an increasing demand for collaboration between news organizations (i.e., collaborative cultures) (Hellmueller, Cheema, and Zhang Citation2017). To facilitate collaboration between newsrooms, media organizations are now taking steps to train and hire journalists with the necessary technical skills or bring in data scientists to work alongside journalists (Guo and Volz Citation2021). These data scientists are expected to contribute to journalistic collaboration by providing their expertise in advanced numerical and quantitative abilities, big data analysis, and website development (Boyles Citation2020).

Thus, collaborating cultures, including coding, have contributed to the emergence of trading zones in journalism. Trading zones are spaces “within which individuals coming from different traditions or with distinct expertise can gather, agree on rules of exchange, and engage in complex, coordinated activity around shared goals” (Haim and Zamith Citation2019, 80–81). For Lewis and Usher (Citation2016), trading zone is a valuable concept for understanding the inclusion of heterogeneous actors in journalism, their social interaction with traditional reporters, and the impact they end up having on newswork. Zhang and Ho (Citation2022) further argue that trading zones in journalism are a sound theoretical framework to capture cross-disciplinary collaboration in contemporary multimodal newsrooms, both offline and online.

As a sign of the evolving technological landscape within newsrooms, trading zones can lead to both fruitful and challenging interactions inside and between news organizations. Data scientists typically do not undergo formal training or education specific to journalism. Their educational background typically revolves around data science, statistics, computer science, or related fields, which could present challenges for their development inside news organizations (Fairfield and Shtein Citation2014). However, this also opens the door for non-data personnel to contribute to data science teams by generating story ideas, evaluating stories’ viability, and ensuring data’s accuracy. At the same time, data scientists can play a crucial role in breathing life into stories by accessing datasets through automated methods and analyzing intricate data points (Zamith Citation2023).

However, trading zones in themselves do not always guarantee increased collaboration between news workers or news organizations. For example, using computational tools like R and JavaScript may be understandable primarily to journalists with technical inclinations, which could inadvertently exclude individuals or organizations lacking proficiency in programming (Stalph, Hahn, and Liewehr Citation2022).

Drawing from Galison’s (Citation1997) first theorization on the concept, in this article, trading zones are understood as virtual and physical spaces where journalists interact, cooperate, and develop new tools with a heterogeneous group of actors with similar interests. The popularization of trading zones also fulfills three primary needs in contemporary journalism: (1) they promote the necessary diversification of professional roles, (2) they bridge the gap in educational backgrounds, thus relieving stress on media workers, and (3) they help overcome workforce limitations and economic conditions shortcomings. In this section, we will review how trading zones assist in fulfilling each of these needs.

Collaboration between news organizations has expanded beyond national borders. Cross-border collaborative journalism has gained popularity primarily due to data journalism initiatives seeking to generate in-depth reporting on complex global topics. As Heft argues, “although journalism has long been considered a profession of lone wolves, its present and future are shaped by collaborative practices of various kinds” (2019, 191). Indeed, cooperation is the norm in processing newly available data (Larrondo-Ureta and Ferreras-Rodríguez Citation2021). Along with these new forms of local and global collaboration, media organizations are facing the need to professionally diversify their newsrooms and hire creative artists, data scientists, and developers who can analyze and reshape journalistic content to the latest platforms media organizations need to use to distribute their news (Royal and Kiesow Citation2021). The integration of traditional journalism with professionals from other fields has not been exempted from difficulties (Weber and Rall Citation2012). Yet, Zhang and Chen (Citation2022) show how, under the trading zones framework, the benefits of interdisciplinary collaboration are positively perceived by media workers.

Moreover, trading zones help journalists overcome the shortcomings of their educational backgrounds, particularly in contexts in which limited education in data handling and mining is available. That is, in a context of rapid technological transformations, in which journalists need to feed a different number of platforms and produce multimodal texts and designs, the limitations in the educational background of media professionals have been bridged by incorporating novel professional roles in journalism (Russell Citation2016). Not doing so could have severe consequences for legacy media and traditional journalists. As Bossio and Nelson argue, “increasing reliance on new tools has been a source of mental and physical burnout for journalists working within new online and digital work environments and competing with new media actors” (2021, 1377). The need to keep up to date with new tools for creating news stories, data analysis, and their distribution has often translated into deteriorating mental health for journalists.

Collaboration in trading zones might be necessary to overcome workforce limitations and economic conditions shortcomings in contemporary newsrooms. As Madrigal and Meyer find out, in their quest to increase their audience metrics and push toward the convergence of their platforms, media organizations have fired hundreds of journalists (2018), favoring those professionals with the data skills and knowledge required to engage with new technologies. Trading zones allow experts from different domains to make their data available and the scripts for how to work with and analyze it. As such, smaller, low-budget newsrooms do not have to dedicate unobtainable resources to exploring a database. Instead, they can benefit from the work arranged by other actors or groups, increasing their production possibilities.

As data-driven journalism shifts further away from traditional cultures within legacy media, new platforms like GitHub and Slack are emerging as alternatives for journalists, hackers, and citizens to engage in the practice of collaborative journalism (Ausserhofer et al. Citation2020; Bunce, Wright, and Scott Citation2018).

Finally, successfully using these platforms to practice cross-border collaborative journalism could have another advantage. Creating virtual spaces where journalists and a diverse range of professionals from different countries and regions interact can aid in eradicating nationalistic viewpoints and fostering a better grasp of intricate global concerns (Sambrook et al. Citation2018). Yet, as Kunert et al. (Citation2022) argue, investigative journalists worldwide are adopting innovative digital practices at different speeds, with journalists in North America and Europe adopting technologies faster than those in developing countries. Cueva Chacón and Saldaña also show that, for the Latin American case, “the practice of national and transnational collaboration in the region is still scarce” (2021, 201). Whether this gap between North America and Europe and the rest of developing countries could signal the reproduction of media asymmetries and cultural dependencies between news organizations on a global scale during the practice of cross-border collaborative coding journalism is a question that remains unanswered. Hence, in this article, we deliberately refrain from presuming a priori that inter-organizational collaboration (especially a balanced one) is the prevailing norm, particularly on a platform like GitHub. Although the literature shows that newsrooms in developing countries benefit from international collaborations in general, we do not assume that they collaborate with the repositories on GitHub equally as countries in North America and Europe. Instead, this is precisely what our study is designed to investigate.

The Social Coding in GitHub

Structurally, GitHub comprises public and private repositories. Repositories are “folders” or spaces where users can save their project files and store and track versions of their codes. This affordance allows for a greater level of data structure, which simplifies the management of information. Moreover, GitHub also supports various file types for users to include in their conversations, such as JPEG, PDFs, GIF, PNG, ZIP, and other text files and videos. The diversity in the supported files allows GitHub to be more than a “code dump” and attract hobbyists and dedicated developers alike (Dabbish et al. Citation2012).

Furthermore, one of GitHub’s main strengths is “the awareness and transparency features it provides to team, project and community members” (Zagalsky et al. Citation2015, 1906), which positively affect people’s likelihood to contribute to shared projects (Tsay, Dabbish, and Herbsleb Citation2012). Transparency plays an essential role in GitHub community building and engagement, for it allows users to learn from the experiences of others. As Dabbish et al. put it, “being able to watch how someone else coded, what others paid attention to, and how they solved problems all supported learning better ways to code and access superior knowledge” (2012, 1283). Dabbish et al. (Citation2012) also point out how users within the platform follow specific actions, signaling both the developer’s status and the quality of their projects. As they put it, “the number of followers a developer had was interpreted as a signal of status in the community. Developers with lots of followers were treated as local celebrities” (2012, 1281).

Undoubtedly, open-source projects permit fellow developers, scientists, and journalists to track software improvement closely. It is also a quality of the platform that aligns with the new ethos behind collaborative journalism. As Porlezza and Splendore (Citation2016) argue, transparency and accountability in journalism, both qualities that GitHub offers, can be seen as a form of incentivizing participation for audiences, investors, or fellow reporters, an indispensable quality for emerging media outlets.

Using GitHub for collaboration also expands the possible actors involved in news production. For example, Mergel (Citation2015) shows how GitHub has allowed fruitful cooperation between public and private communities for years. The GitHub online community can reuse, contribute, and improve the work made available by government coders, and in turn, government workers contribute to other organizations’ software development.

In a different context, Flores-Saviaga and Savage (Citation2021) show how, in the aftermath of the earthquake that hit Mexico on the 19th of September, 2017 (known as the #19S earthquake), different users across social media came together to assist media outlets, NGOs, and universities in their effort to verify information related to the disaster using collaborative tools like GitHub and Slack. Flores-Saviaga and Savage found that GitHub allowed some volunteers to “create digital tools that seemed to bridge everyday users and technical ones” (2021). Collaborating with more experienced developers or allowing them to follow their actions and problem-solving strategies shows how this platform could be a tempting option for low-budget newsrooms worldwide.

In recent decades, GitHub has become increasingly popular as an open-source platform that functions as a trading zone for journalists, entrepreneurs, graphic designers, developers, and other actors. GitHub has been critical in popularizing collaborative coding cultures for sharing and analyzing information. This study examines how different stakeholders in journalism collaborate through GitHub by contributing and bookmarking scripts. Through a content analysis, we identify (1) the different project types developed, (2) the distinct actors across the globe that contribute to an array of repositories, (3) and the media organizations that are producing the projects and repositories. These questions stem directly from the recent literature on trading zones and GitHub and allow us to observe whether GitHub is indeed functioning as a trading zone for journalism or whether the scarcity of national and cross-national collaboration described by Cueva Chacón and Saldaña (Citation2021) is replicated on GitHub. Notably, as Zhang and Ho (Citation2022) argue, one of the critical characteristics of trading zones is their cross-disciplinary collaboration. Thus, it is indispensable to see the type of actors involved in this coding culture. Furthermore, this is also an opportunity to see if GitHub is bridging the gap between newsrooms across a range of countries, thus allowing the development of projects that benefit collaborative journalism, which seems to be the ethos of modern investigative journalistic projects (Larrondo-Ureta and Ferreras-Rodríguez Citation2021). Lastly, as Ausserhofer et al. (Citation2020) argues, it is essential to test if the most popular journalistic repositories belong to legacy media outlets or if it has effectively opened the door for alternative organizations and actors.

Methodology

We obtained 8,686 repositories of 140 media organizations by querying GitHub’s application programming interface (API) from 2010 until 2022 (∼12 years). Due to the restrictions imposed by the GitHub API on the number of requests, it was necessary to establish a maximum limit of 100 projects per organization on each call. Nevertheless, all the projects from the media companies were thoroughly extracted. To retrieve data from GitHub’s API, username accounts are needed. We gathered the usernames by manually inspecting a list of 950 organizations in the first 50 countries outlined in the World Press Freedom Index. In , we present an overview of the global distribution of media organizations with and without GitHub accounts. Our findings reveal 170 unique GitHub accounts from the 950 media organizations inspected, with a majority being based in the United States. It is also important to note that among the media companies, two had multiple GitHub accounts. For instance, The New York Times operated both “nytimes” and “newsdev” accounts, while Politico managed “politico” and “The-Politico” accounts.

Figure 1. Overview of media organizations with GitHub accounts across different countries.

Figure 1. Overview of media organizations with GitHub accounts across different countries.

From the subset, we extracted metadata—including elements such as descriptions, coding languages, stargazer counts, and followers—to identify the most popular ones and their projects. The complete list of media organizations inspected and the different GitHub accounts can be examined in the following link: t.ly/c4P7.

With this approach, we gathered 8,900 repositories. However, those lacking information were excluded. Thus, in our final sample, we only analyzed 8,686 repositories from 140 different media companies, including those with double GitHub accounts (i.e., The New York Times and Politico). On average, each media company maintained approximately 61 repositories. The repositories included information such as the project description, the number of stargazers, the number of forks, the programming language, and the users that contributed to the project.

Operationalization of Interactions

Before proceeding with our analysis, we must define four essential concepts for this study: stargazers, watchers, forks, and contributors. Stargazers refer to users who have bookmarked a repository, indicating engagement and potential future interest. The number of stargazers also reflects a repository’s popularity or perceived value in the GitHub community.

Watchers, on the other hand, are users who subscribe to updates for a particular repository and receive notifications about various activities, such as new issues, pull requests, or general updates. Unlike stargazing, watching a repository indicates a higher level of interest and active engagement with the project. We examine the number of watchers to measure community involvement and sustained attention toward a repository.

A “fork” is when a user creates a personal copy of a repository on GitHub. This process duplicates the entire repository, including all files, commit history, and branches, into the user’s GitHub account.

Lastly, contributions refer to a range of actions users take to participate in a repository. We consider contributions as interactions that involve active engagement with the repository, such as making commits, creating branches, opening pull requests, submitting issues, and providing comments on existing code or discussions. By evaluating contributions, we aim to understand the level of involvement and collaboration within a repository.

Users

We sample all user accounts contributing to each repository, aiming to understand the location of the interactions. We extracted 15,726 unique users, including their biographies. Drawing from users’ biographies, we coded 14,080 profiles into six groups: journalists, developers, scientists, learners, entrepreneurs, and graphic designers. These categories are selected since prior research describes them as common professional roles in contemporary newsrooms due to increased data management practices (Engebretsen, Kennedy, and Weber Citation2018). Users were only categorized once using string detection (i.e., exclusively categorized) based on the presence and order of keywords in their biographies (Appendix 1, available in OSF). The categorization of users was tested through a logistic regression model, obtaining .98 accuracy.

Furthermore, we identified contributors’ countries of origin from those with a location in their profile. In total, we identified 122 different locations (). In addition, shows an overview of the media companies in our sample globally.

Figure 2. Distribution of media companies in our sample across the globe.

Figure 2. Distribution of media companies in our sample across the globe.

Table 1. Descriptives overview.

Additionally, we extracted and analyzed the description of the 8,686 projects. Following prior studies, we used an inductive-deductive analysis to identify themes in the repositories (Boumans and Trilling Citation2016; Grootendorst Citation2022; Wiedemann and Fedtke Citation2022). The first description analysis was done through unsupervised topic modeling using a pre-trained topic model (BERTopic) (Grootendorst Citation2022). We decided to use BERTopic since it considers the semantic relationship among words compared to other models, such as LDA (Grootendorst Citation2022). Before implementing the model, we pre-processed the descriptions in preparation for analysis. This pre-processing step involved removing URLs, symbols, and stopwords. Then, we tokenized the works and processed the text using the “all-MiniLML6-v2” MiniLM language model for BERTopic. This language model has been chosen for its exceptional performance in ensuring topic coherence for short texts. This approach allowed us to obtain an initial list of 155 topics acknowledging the context of the words. The performance of BERTopic was reflected through a topic coherence of .75.

Later, we ran several simulations estimating dimensionality and the number of topics to assess the clustering for the best topic coherence performance. The results of the simulations pointed out that 25 topics would provide the best topic coherence scoring .44. The identification of the topics was performed manually by analyzing the loadings in the topic modeling, the prevalent words that were exclusive from other topics, and the documents that were representative of each topic (Wiedemann and Fedtke Citation2022). We used the topics assessed by BERTopic to build a codebook. The details of the codebook can be found in the following OSF link: t.ly/c4P7.

Results

The way media organizations worldwide use GitHub as a platform for collaboration displays a great diversity of actors and a variety of topics. In this section, we described the stakeholders, how users interact with the repositories, and the themes they engage with. To understand how news organizations in different countries use GitHub as a code-sharing platform, we present the overall results in , , and .

Figure 3. Top ten media companies with highest stargazed projects.

Figure 3. Top ten media companies with highest stargazed projects.

Figure 4. Frequency of projects by topic.

In , the frequency is calculated through the count of categorized repositories under a specific topic.

Figure 4. Frequency of projects by topic.In Figure 4, the frequency is calculated through the count of categorized repositories under a specific topic.

Figure 5. Frequency of user types interacting with topicsa.

aIn , the frequency is calculated through a count of topic categories observed across the different types of users.

Figure 5. Frequency of user types interacting with topicsa.aIn Figure 5, the frequency is calculated through a count of topic categories observed across the different types of users.

Primary Companies

A list of media companies was analyzed, including the Guardian, BBC, Seattle Times, FranceTV, NRKO, and more. The top five media companies with more repositories are located in North America and Western Europe. For example, the Guardian had 1,823 repositories, followed by the BBC with 1,008, both companies located in the United Kingdom. The companies with the most repositories were Seattle Times (n = 453) and Los Angeles Times (n = 207), both located in the United States. Finally, the fifth media company with the most repositories was Naver (n = 207), a South Korean online platform. The OSF (Open Science Framework) provides a comprehensive list of the number of repositories for the extracted media companies, which can be accessed through the following link: t.ly/c4P7 ().

Table 2. Top media companies with repositories.

Moreover, we have mapped the top ten media companies with the most frequently stargazed projects (as shown in ). Yahoo News, The New York Times, and the Naver are the leading media companies with the highest number of stargazed repositories. Notably, Yahoo News reaches almost 52,939 stargazed counts for their projects, whereas The New York Times has 44,382 stargazed times and Naver 26,714. Significantly, most of these media companies (e.g., Yahoo News, The New York Times, Bloomberg, The Onion) are based in the United States. At the same time, Naver is located in South Korea, and The Guardian and the BBC are based in the United Kingdom. We included a comprehensive list of the distribution of stargazers per organization and repository in the OSF link: t.ly/c4P7.

Primary Projects

The description of the projects served to classify the repositories. In total, we categorized 4,635 projects (i.e., repositories) (). From this categorization, we observed that 630 projects were related to the Guardian’s internal deployment of repositories: “repo-genesis.” Since the Guardian utilizes a web-based application to streamline their deployment on GitHub, it is accessible for everyone in the organization to deploy repositories and maximize their data collection efforts. The second largest repository category was data visualization, with 469 projects focused on creating graphics, plots, and interactive visualizations. Another 449 projects were related to web libraries in languages such as Python and JavaScript, while 396 projects dealt with API development and 322 with website and frontend components. An additional 313 projects focused on news topics such as climate change, elections, and crime rates. From these projects, we also analyzed the most common programming languages. Since the website and frontend components were a common topic in the repositories, the most common programming languages are JavaScript (n = 2,659), HTML (n = 975), and Python (n = 818).

In addition, we examine the distribution of repositories across our sample. Most projects are developed in the United States (26.9%, n = 1,963) and the United Kingdom (38.6%, n = 2,821). In this regard, all the projects’ topics are covered in these countries (the United States and the United Kingdom), including emerging themes such as machine learning and the development of conversational bots. Interestingly, countries like Uruguay or Lithuania do not seem to have repositories under those categories.

Primary Actors

We extracted 15,726 unique actors. We categorize the users using actors’ biographies. We coded 14,080 profiles into five groups: journalists, developers, scientists, entrepreneurs, and graphic designers. This approach resulted in 3,331 journalists, 8,577 developers, 327 scientists, 1,689 entrepreneurs, and 156 graphic designers from 15,726 users. The remaining users (1,646) were only considered for an overview of contributions since the descriptions of their bios were not enough to categorize them. From this sample, we estimated the percentage of contributions by the type of user. We observed that from 21,377 contributions, 43% of the contributions were done by developers (n = 9,192), 11.66% (n = 2,493) by entrepreneurs, 7.33% (n = 1,569) by journalists, 3.48% (n = 744) were made by scientists, and 0.5% (n = 109) graphic designers. In our sample, we observed that, on average, each media company had 101 contributors. However, on average, users only contributed .019 to the different repositories across the media companies. The total number of contributors per media company can be found in our OSF: t.ly/c4P7.

Projects and Collaborations

Analyzing user interaction types, we see that developers are the primary contributors to all repository themes (e.g., web libraries, website and frontend components, and data visualizations). Journalists contribute mainly to data visualization projects or news coverage topics. Surprisingly, although data visualization projects are common, graphic designers are not the main contributors to this topic. Developers, entrepreneurs, scientists, and journalists interact more with data visualization repositories than graphic designers (i.e., DataViz). In contrast, journalists are subscribers of the Guardian’s public repository development, using the newsroom template to create their projects (). Moreover, since GitHub is also used for learning, also shows that contributions to documentation and learning projects are usually made by journalists and entrepreneurs, in addition to developers.

By exploring the contributions of journalists for each country (), we see that journalists mainly contribute to projects in the United States (US) (n = 58), the United Kingdom (n = 26), Australia (n = 10), Canada (n = 10) and New Zealand (n = 10). Interestingly, journalists mainly subscribe to projects in the United States (n = 41) and the United Kingdom (UK) (n = 20). Nevertheless, on average, users contribute only once to projects. Multiple interactions with the same repositories are not frequent.

Figure 6. Frequency of journalist interactions with media companies’ locationa.

aIn , the frequency is calculated through the count of observed interaction types across user types and location.

Figure 6. Frequency of journalist interactions with media companies’ locationa.aIn Figure 6, the frequency is calculated through the count of observed interaction types across user types and location.

Furthermore, an analysis of contributions by country of origin is presented in . The top ten contributors include developers from the United States (n = 888), United Kingdom (n = 386), Germany (n = 209), France (n = 141), and Canada (n = 103), as well as Sweden (n = 79) and the Netherlands (n = 76). Additionally, contributions from entrepreneurs in the United States (n = 292) and the United Kingdom (n = 107) and scientists in the United States (n = 115) are also among the top contributors. It is important to emphasize that while GitHub does offer a platform for collaboration across news organizations, the data indicates that contributions to repositories typically originate from specific users within each organization. Despite the potential for collaboration, users predominantly focus on their respective news organizations and may not contribute to other organizations as frequently as initially anticipated, as illustrated in .

Figure 7. Top ten frequencies of contributions by users’ locationa.

aIn , the frequency is calculated through the count of observed contributors (a specific type of interaction) across user types and locations.

Figure 7. Top ten frequencies of contributions by users’ locationa.aIn Figure 7, the frequency is calculated through the count of observed contributors (a specific type of interaction) across user types and locations.

Figure 8. Top ten contributors to repositories in different media companies.

Figure 8. Top ten contributors to repositories in different media companies.

Furthermore, our observations reveal that most users are employed by the news organization with which they interact. This finding underscores the close association between users and their affiliated news organizations, likely influencing their contribution patterns. For a more comprehensive understanding of the relationship between news organizations and user affiliation, we included a dataset with detailed information on the OSF link: t.ly/c4P7.

Discussion

As journalists increasingly rely on and report about digital data, they rely on trading zones to find collaborators to analyze, interpret, and display the data, helping to alleviate the burden of information asymmetry (Gray, Chambers, and Bounegru Citation2012). Recent literature shows that a new collaborative coding culture has emerged from these practices, marked by cross-disciplinary cooperation between media professionals, transparency in using codes and scripts, and the accessibility of large data sets. However, our results complicate the idea of collaboration in coding cultures by showing different degrees of reciprocity or cooperation in the information flow on GitHub’s repositories.

This article aimed to study how media organizations across a range of countries use GitHub as a platform for collaboration. Focusing on GitHub was not an accident. Previous studies have shown the potential behind this platform to promote cooperation between actors with different professional backgrounds and skills (Zagalsky et al. Citation2015). Our results suggest that GitHub is playing the role of trading zone for journalistic work. Professionals with different backgrounds and skill sets come to this platform to engage in the production and consumption of coding materials. As we have described, developers are the primary actors in all the different projects contributing to GitHub repositories owned by media corporations. In line with previous literature (Brossi and Dodds Citation2019; Lewis and Usher Citation2014), we see a diversification of professional roles in news development. Developers as the main contributors might result from more familiarity with the platform, which coders and data scientists primarily use.

Aligned with these findings, users mainly contribute to the same news organizations they are employed by. In this sense, users’ familiarity with their respective organization’s inner infrastructure plays an essential role in driving their interactions with the repositories. Their familiarity likely fosters a sense of ease and efficiency in contributing to projects within their organization. As a result, although GitHub can function as a trading zone platform by bringing together different types of professionals, collaboration across different organizations may be less common due to users’ strong association with their employing organizations.

These results put tension on the ‘collaborative’ part of coding cultures. These findings suggest limited cross-organizational collaboration, particularly between countries of different geographical regions. Instead, we see a notable trend toward intra-collaboration, with journalists predominantly engaging in collaborative efforts with their own organizations and repositories.

Our results show that GitHub, a platform that has experienced exponential growth since its foundation in 2008, is not alleviating the burden of smaller low-budget newsrooms that cannot gather or analyze large data sets and bridge journalists’ work and developers’ skills. However, the directionality of information flow seems to benefit smaller low-budget newsrooms, as they consume repositories more than they collaborate with them. To test how news organizations use GitHub, we extracted metadata from news organizations’ GitHub accounts (e.g., descriptions, coding languages, stargazer counts, followers). In addition, we identified the most popular media organizations based on the stargazed counts of their projects and sampled the users’ accounts contributing to them.

Regarding our main research question, namely, how news organizations use the code-sharing platform GitHub, we found that those organizations in the United States host the projects with the highest stargaze counts. We found that media outlets in North America, Western Europe, and South Korea have the highest number of repositories and interactions. Although media organizations assisting other media organizations in news production is a positive outcome, future research needs to study if the reliance on newsrooms in North America and Western Europe’s repositories eventually translates into new self-owned projects for newsrooms in other countries around the world or this remains a dependent relationship between one and the other.

Our results showed that GitHub mainly fulfills reporters’ need for the graphical display of big data sets (Gray, Chambers, and Bounegru Citation2012). Following recent literature, the collaboration emerges as the need to adapt various journalistic content to the latest platforms utilized by media organizations for news dissemination (Royal and Kiesow Citation2021). Surprisingly, one of the main themes was how The Guardian facilitates the deployment of repositories among the organization to maximize the collected data points. Interestingly, this finding points out the need for analyzing and reporting the abundance of public interest data. As Zhang and Ho (Citation2022) asserted, the trading zones show an interconnection between physical and digital settings. In this scenario, news organizations require not only a collaborative coding culture to analyze the growing number of data points (Gray, Chambers, and Bounegru Citation2012) but also to produce additional information that aligns with the organization’s primary objective of keeping the public informed (Hellmueller, Cheema, and Zhang Citation2017).

Future research should explore the relationship between media cooperation size, turnover, and their ability to contribute to collaborative data journalism. Our data indicates that many media companies among the top contributors to GitHub are large, successful corporations. However, in the context of media market concentration and “winner takes all” dynamics that have characterized the news media market in recent years, it remains an open question whether sharing the GitHub repositories serves as an equalizer or is, in reality, more likely to be a transparency tool that is costly to create and therefore further contributes to the economy of scale, thus perpetuating media asymmetries and cultural dependencies between news organizations on a global scale.

Finally, our results suggest that trading zones promote the necessary diversification of professional roles inside newsrooms, bridging the gap in educational backgrounds and helping small-budget newsrooms overcome workforce limitations and shortcomings caused by economic conditions. Perhaps these financial shortcomings help explain the imbalance in the “collaborative” part of coding cultures. As we posit at the beginning of this article, trading zones are both virtual and physical spaces, and the limitations of one (e.g., limited technological infrastructure in a low-budget newsroom) could affect their ability to produce coding repositories. Yet, collaborative (coding) cultures in journalism demand not an equal amount of supply but rather a nuanced understanding of how diverse skill sets, perspectives, and local expertise can, in a symbiotic relationship, benefit those proficient in coding and those who are still learning.

Conclusion

Based on our data, our results suggest that major English-language media outlets mainly produce emerging journalistic trading zones on the GitHub platform. Therefore, our results reveal an ongoing asymmetrical relationship between news organizations dominated by well-resourced media giants in North America and Western Europe. However, our data also show a great deal of diversity—the act of sharing digital resources, scripts, data, and learning content that connects a variety of contributors from newsrooms worldwide. Furthermore, media organizations in developing countries interact with the repositories of media companies in one way or another—to some extent, they speak the same language and share specific interests. From this perspective, the idea of a ‘global network of journalism’ seems to be reinforced (Berglez and Gearing Citation2018).

Future studies could start here and thus overcome one of the significant limitations of this study: Our initial media sample already includes most countries from North America and Western Europe. The focus on the ‘most robust’ media systems—the highest-ranking countries in the Press Freedom Index—seems plausible if the objective is to sample those countries with particularly active participants in the trading zone around media technology. Though this was, in fact, one of the aims of this study, it also meant that countries in Central and South America, Africa, and other regions of the world had been underrepresented from the very beginning. Moreover, the statements made here are based on a small and heterogeneous sample – raising the question of whether developing countries will reflect more diversity in the interaction between newsroom organizations worldwide. In any case, future studies should choose a more balanced country sample or one with the opposite constellation, for example, with the US and UK as the only representatives of North America and Western Europe. Since language, in particular, plays an essential role in adopting technologies (e.g., conversational bots), the digital divide often runs along the gap between low-resource and high-resource languages; it would be interesting to study English-speaking countries outside North America and Western Europe.

Furthermore, the analysis of topics on GitHub reveals a strong presence of the Guardian deployment system without offering insights into the range of topics the organization covers. Future research should broaden the representation of topics by employing the readme files as the unit of analysis.

Another limitation of the initial sample is that it consists exclusively of the GitHub accounts of traditional media companies. It is likely—and our data suggested so—that specific networks of actors that address media technology are not visible here. Future studies should focus on institutions beyond traditional mainstream journalism, such as smaller online-only media, news aggregators, research collectives, NGOs, individual reporters’ accounts, etc.

Qualitative studies could lay a foundation for this selection, identifying key actors and collaboration structures via interviews. A mixed-method approach could, for example, be used to identify critical stakeholders as potential interviewees on GitHub and to contact them via the platform. This would also answer whether GitHub is the primary platform for exchanging digital resources in all countries or whether trading zones can also be found elsewhere.

At the same time, our results suggest that the trading zone we considered also attracts actors whose connection with journalism is marginal at best. On the GitHub platform, journalistic interests overlap and interact with commercial and scientific interests. Future research could investigate how specific resources (e.g., individual scripts) transcend the boundaries of their original context and are recycled in other environments or adapted to different system logic.

Our study not only advances the current understanding of journalistic trading zones but also provides a roadmap for future research directions that can contribute to a more comprehensive and inclusive exploration of media technology landscapes globally. The intricate interplay of diverse actors and interests on GitHub underscores the evolving nature of journalism in the digital age, urging scholars to explore beyond traditional boundaries, contributing to a better understanding of the cross-pollination of ideas and practices within and beyond the journalistic sphere.

Disclosure Statement

No potential conflict of interest was reported by the author(s).

References

  • Alfter, B. 2019. Cross-Border Collaborative Journalism: A Step-By-Step Guide. London: Routledge.
  • Alfter, B., and S. Cândea. 2019. “Cross-Border Collaborative Journalism: New Practice, New Questions.” Journal of Applied Journalism & Media Studies 8 (2): 141–149. https://doi.org/10.1386/ajms.8.2.141_1
  • Appelgren, E., C.-G. Lindén, and A. van Dalen. 2019. “Data Journalism Research: Studying a Maturing Field across Journalistic Cultures, Media Markets and Political Environments.” Digital Journalism 7 (9): 1191–1199. https://doi.org/10.1080/21670811.2019.1685899
  • Ausserhofer, J., R. Gutounig, M. Oppermann, S. Matiasek, and E. Goldgruber. 2020. “The Datafication of Data Journalism Scholarship: Focal Points, Methods, and Research Propositions for the Investigation of Data-Intensive Newswork.” Journalism 21 (7): 950–973. https://doi.org/10.1177/1464884917700667
  • Bebawi, S. 2016. “Issues and Challenges.” In Investigative Journalism in the Arab World: Issues and Challenges, edited by S, Bebawi, 120–149. UK: Palgrave Macmillan. https://doi.org/10.1057/9781137461049_6
  • Berglez, P., and A. Gearing. 2018. “The Panama and Paradise Papers. The Rise of a Global Fourth Estate.” International Journal of Communication 12: Article 0.
  • Bossio, D., and J. L. Nelson. 2021. “Reconsidering Innovation: Situating and Evaluating Change in Journalism.” Journalism Studies 22 (11): 1377–1381. https://doi.org/10.1080/1461670X.2021.1972328
  • Boumans, J., and D. Trilling. 2016. “Taking Stock of the Toolkit: An Overview of Relevant Automated Content Analysis Approaches and Techniques for Digital Journalism Scholars.” Digital Journalism 4 (1): 8–23. https://doi.org/10.1080/21670811.2015.1096598
  • Boyles, J. L. 2020. “Deciphering Code: How Newsroom Developers Communicate Journalistic Labor.” Journalism Studies 21 (3): 336–351. https://doi.org/10.1080/1461670X.2019.1653218
  • Brossi, L., & Dodds, T. (Eds.). 2019. Visualización de Datos: Periodismo y Comunicación en la Era de la Información Visual. Año: Editorial Universitaria.
  • Broussard, M., and K. Boss. 2018. “Saving Data Journalism.” Digital Journalism 6 (9): 1206–1221. https://doi.org/10.1080/21670811.2018.1505437
  • Bunce, M., K. Wright, and M. Scott. 2018. “Our Newsroom in the Cloud’: Slack, Virtual Newsrooms and Journalistic Practice.” New Media & Society 20 (9): 3381–3399. https://doi.org/10.1177/1461444817748955
  • Carson, A., and K. Farhall. 2018. “Understanding Collaborative Investigative Journalism in a “Post-Truth” Age.” Journalism Studies 19 (13): 1899–1911. https://doi.org/10.1080/1461670X.2018.1494515
  • Cueva Chacón, L. M., and M. Saldaña. 2021. “Stronger and Safer Together: Motivations for and Challenges of (Trans)National Collaboration in Investigative Reporting in Latin America.” Digital Journalism 9 (2): 196–214. https://doi.org/10.1080/21670811.2020.1775103
  • Curry, A. L., and N. J. Stroud. 2021. “The Effects of Journalistic Transparency on Credibility Assessments and Engagement Intentions.” Journalism 22 (4): 901–918. https://doi.org/10.1177/1464884919850387
  • Dabbish, L., C. Stuart, J. Tsay, and J. Herbsleb. 2012. “Social Coding in GitHub: Transparency and Collaboration in an Open Software Repository.” Proceedings of the ACM 2012 Conference on Computer Supported Cooperative Work, 1277–1286. https://doi.org/10.1145/2145204.2145396
  • Davis, R. C. 2015. “Git and GitHub for Librarians.” Behavioral & Social Sciences Librarian 34 (3): 158–164. https://doi.org/10.1080/01639269.2015.1062586
  • Dodds, T. 2019. “Reporting with WhatsApp: Mobile Chat Applications’ Impact on Journalistic Practices.” Digital Journalism 7 (6): 725–745. https://doi.org/10.1080/21670811.2019.1592693
  • Dodds, T. 2021. “Structures of Resistance: Citizen-Generated Reporting in Times of Social Unrest.” In Politics of Disinformation, 119–131. Hoboken, NJ: John Wiley & Sons, Ltd. https://doi.org/10.1002/9781119743347.ch9
  • Engebretsen, M., H. Kennedy, and W. Weber. 2018. “Data Visualization in Scandinavian Newsrooms: Emerging Trends in Journalistic Visualization Practices.” Nordicom Review 39 (2): 3–18. https://doi.org/10.2478/nor-2018-0007
  • Fairfield, J., and H. Shtein. 2014. “Big Data, Big Problems: Emerging Issues in the Ethics of Data Science and Journalism.” Journal of Mass Media Ethics 29 (1): 38–51. https://doi.org/10.1080/08900523.2014.863126
  • Flores-Saviaga, C., and S. Savage. 2021. “Fighting Disaster Misinformation in Latin America: The #19S Mexican Earthquake Case Study.” Personal and Ubiquitous Computing 25 (2): 353–373. https://doi.org/10.1007/s00779-020-01411-5
  • Galison, P. 1997. Image and Logic: A Material Culture of Microphysics. Chicago: University of Chicago Press.
  • Gonzalez, D., T. Zimmermann, and N. Nagappan. 2020. “The State of the ML-Universe: 10 Years of Artificial Intelligence & Machine Learning Software Development on GitHub.” Proceedings of the 17th International Conference on Mining Software Repositories, 431–442. https://doi.org/10.1145/3379597.3387473
  • Gray, J., L. Chambers, and L. Bounegru. 2012. The Data Journalism Handbook: How Journalists Can Use Data to Improve the News (1st Ed.). Beijing: O’Reilly Media.
  • Grootendorst, M. 2022. BERTopic: Neural topic modeling with a class-based TF-IDF procedure (arXiv preprint arXiv:2203.05794). https://arxiv.org/pdf/2203.05794.pdf.
  • Guo, L., and Y. Volz. 2021. “Toward a New Conceptualization of Journalistic Competency: An Analysis of U.S. Broadcasting Job Announcements.” Journalism & Mass Communication Educator 76 (1): 91–110. https://doi.org/10.1177/1077695820925306
  • Haim, M., and R. Zamith. 2019. “Open-Source Trading Zones and Boundary Objects: Examining GitHub as a Space for Collaborating on “News.” Media and Communication 7 (4): 80–91. https://doi.org/10.17645/mac.v7i4.2249
  • Heft, A. 2019. “The Panama Papers Investigation and the Scope and Boundaries of Its Networked Publics: Cross-Border Journalistic Collaboration Driving Transnationally Networked Public Spheres.” Journal of Applied Journalism & Media Studies 8 (2): 191–209. https://doi.org/10.1386/ajms.8.2.191_1
  • Heft, A. 2021. “Transnational Journalism Networks “from below”. Cross-Border Journalistic Collaboration in Individualized Newswork.” Journalism Studies 22 (4): 454–474. https://doi.org/10.1080/1461670X.2021.1882876
  • Hellmueller, L., S. E. Cheema, and X. Zhang. 2017. “The Networks of Global Journalism.” Journalism Studies 18 (1): 45–64. https://doi.org/10.1080/1461670X.2016.1215254
  • Kunert, J., J. Frech, M. Brüggemann, V. Lilienthal, and W. Loosen. 2022. “How Investigative Journalists around the World Adopt Innovative Digital Practices.” Journalism Studies 23 (7): 761–780. https://doi.org/10.1080/1461670X.2022.2033636
  • Larrondo-Ureta, A., and E.-M. Ferreras-Rodríguez. 2021. “The Potential of Investigative Data Journalism to Reshape Professional Culture and Values. A Study of Bellwether Transnational Projects.” Communication & Society 34 (1): 41–56. https://doi.org/10.15581/003.34.1.41-56
  • Lewis, N. P., and E. A. Nashmi. 2019. “Data Journalism in the Arab Region: Role Conflict Exposed.” Digital Journalism 7 (9): 1200–1214. https://doi.org/10.1080/21670811.2019.1617041
  • Lewis, S. C., and N. Usher. 2013. “Open Source and Journalism: Toward New Frameworks for Imagining News Innovation.” Media, Culture & Society 35 (5): 602–619. https://doi.org/10.1177/0163443713485494
  • Lewis, S. C., and N. Usher. 2014. “Code, Collaboration, and the Future of Journalism.” Digital Journalism 2 (3): 383–393. https://doi.org/10.1080/21670811.2014.895504
  • Lewis, S. C., and N. Usher. 2016. “Trading Zones, Boundary Objects, and the Pursuit of News Innovation: A Case Study of Journalists and Programmers.” Convergence: The International Journal of Research into New Media Technologies 22 (5): 543–560. https://doi.org/10.1177/1354856515623865
  • Loosen, W., J. Reimer, and F. De Silva-Schmidt. 2020. “Data-Driven Reporting: An on-Going (r)Evolution? An Analysis of Projects Nominated for the Data Journalism Awards 2013–2016.” Journalism 21 (9): 1246–1263. https://doi.org/10.1177/1464884917735691
  • Madrigal, A., and R. Meyer. 2018, October 18. How Facebook’s Chaotic Push Into Video Cost Hundreds of Journalists Their Jobs. The Atlantic. https://www.theatlantic.com/technology/archive/2018/10/facebook-driven-video-push-may-have-cost-483-journalists-their-jobs/573403/.
  • Mergel, I. 2015. “Open Collaboration in the Public Sector: The Case of Social Coding on GitHub.” Government Information Quarterly 32 (4): 464–472. https://doi.org/10.1016/j.giq.2015.09.004
  • Mutsvairo, B. 2019. “A New Dawn for the “Developing” World? Probing Data Journalism in Non-Western Societies.” In Data Journalism in the Global South, edited by B. Mutsvairo, S. Bebawi, and E. Borges-Rey, 1–20. Cham: Springer International Publishing. https://doi.org/10.1007/978-3-030-25177-2_1
  • Porlezza, C., and S. Splendore. 2016. “Accountability and Transparency of Entrepreneurial Journalism.” Journalism Practice 10 (2): 196–216. https://doi.org/10.1080/17512786.2015.1124731
  • Royal, C., and D. Kiesow. 2021. “From Boundary to Bridge and beyond: The Path to Professionalization of Product Roles in Journalism.” Journalism Studies 22 (11): 1546–1565. https://doi.org/10.1080/1461670X.2021.1944277
  • Russell, A. 2016. Journalism as Activism: Recoding Media Power (1st ed.). Cambridge: Polity.
  • Sambrook, R., C. Lewis, B. Alfter, N. Kayser-Bril, A. Koch, and J. Clements. 2018. Global teamwork: The rise of collaboration in investigative journalism. Reuters Institute for the Study of Journalism. https://ora.ox.ac.uk/objects/uuid:4a2d40ba-c7c3-482d-9f59-cc67b0c7f555.
  • Stalph, F., O. Hahn, and D. Liewehr. 2022. “Local Data Journalism in Germany: Data-Driven Reporting Amidst Local Communities and Authorities.” Journalism Practice 17 (9): 1882–1901. https://doi.org/10.1080/17512786.2021.2019089
  • Tsay, J. T., L. Dabbish, and J. Herbsleb. 2012. “Social Media and Success in Open Source Projects.” Proceedings of the ACM 2012 Conference on Computer Supported Cooperative Work Companion, 223–226. https://doi.org/10.1145/2141512.2141583
  • Wachs, J., M. Nitecki, W. Schueller, and A. Polleres. 2022. “The Geography of Open Source Software: Evidence from GitHub.” Technological Forecasting and Social Change 176: 121478. https://doi.org/10.1016/j.techfore.2022.121478
  • Weber, W., and H. Rall. 2012. “Data Visualization in Online Journalism and Its Implications for the Production Process.” 2012 16th International Conference on Information Visualisation, 349–356. https://doi.org/10.1109/IV.2012.65
  • Wiedemann, G., and C. Fedtke. 2022. “From Frequency Counts to Contextualized Word Embeddings.” In Handbook of Computational Social Science, 366–385
  • Wright, K., R. Zamith, and S. Bebawi. 2019. “Data Journalism beyond Majority World Countries: Challenges and Opportunities.” Digital Journalism 7 (9): 1295–1302. https://doi.org/10.1080/21670811.2019.1702472
  • Wu, S. 2022. “Asian Newsrooms in Transition: A Study of Data Journalism Forms and Functions in Singapore’s State-Mediated Press System.” Journalism Studies 23 (4): 469–486. https://doi.org/10.1080/1461670X.2022.2032802
  • Zagalsky, A., J. Feliciano, M.-A. Storey, Y. Zhao, and W. Wang. 2015. “The Emergence of GitHub as a Collaborative Platform for Education.” Proceedings of the 18th ACM Conference on Computer Supported Cooperative Work & Social Computing, 1906–1917. https://doi.org/10.1145/2675133.2675284
  • Zamith, R. 2023. “Open-Source Repositories as Trust-Building Journalism Infrastructure: Examining the Use of GitHub by News Outlets to Promote Transparency, Innovation, and Collaboration.” Digital Journalism 2023: 1–22. https://doi.org/10.1080/21670811.2023.2202873
  • Zhang, X., and M. Chen. 2022. “Journalists’ Adoption and Media’s Coverage of Data-Driven Journalism: A Case of Hong Kong.” Journalism Practice 16 (5): 901–919. https://doi.org/10.1080/17512786.2020.1824126
  • Zhang, X., and J. C. F. Ho. 2022. “Exploring the Fragmentation of the Representation of Data-Driven Journalism in the Twittersphere: A Network Analytics Approach.” Social Science Computer Review 40 (1): 42–60. https://doi.org/10.1177/0894439320905522