Mapping Energy Technology

Atakan Kara
Feb 15 · 22 min read

Protocol for data harvest from Wikipedia

Diagram represents the various steps we have applied for the mapping of networks.

Network of “Energy technology”

Above is an annotated graph of Energy technology pages networked by all Wikipedia links they share.

Distributed generation in the landscape of energy technology

Solar Power
Wind Energy
Hydro Power
Energy Storage

A network extracted through semantic analysis

Semantic analysis in CorTexT. A word phrase is represented in a node. Thick triangles (nodes) stand for phrases with a high global count. An edge represents its co-occurrence with another term. To each cluster belong names of three wikipedia sites on which phrases mostly occur.

Timelines for Wikipedia pages: “Renewable energy” & “Nuclear power”

“Renewable energy” revision history timeline between 2009–2019
“Nuclear power” revision history timeline between 2009–2019

Mapping the scientific controversy

Protocol for data harvest from Scopus

Protocol for data harvest


Nodes represent articles which link to other articles within our body of articles (each 4% of the 16 most prevalent subject areas within renewable energies, see protocol), ranked by citation). Size of the nodes are based on an overall count of received citations, by both papers of within the body, and all other papers on Scopus. Small captions represent the title of the five most cited articles. Edges equal the direction of citation to/from other articles. Not connected nodes have been removed. Clusters with more than 12 nodes have been annotated.

Subject area-network based on coreference

Network based on articles from Scopus (the 4% most cited of the 16 most prevalent subject areas within renewable energy, see protocol above). Nodes represent articles, edges represent two or more shared references between two articles. The nodes have been sized according to how many edges in the network they are connected to. The edges have been weighted according to the number of shared references between two articles. An edge with heavy weight thus means that they share many references.

The topology of renewable energies

Country-network based on coreference

Network based on coreference by 10 most publishing countries (the 10% most cited). Nodes represent articles, colored based on country of origin (US: blue; China: orange; India: green; Germany: black). Node size equals the aggregated edge count. Edges represent shared references. Unconnected nodes and nodes which share less than two references with articles are removed.

Network extracted through semantic analysis

A network built on a semantic analysis of articles containing the word “nuclear” in the abstract. Each cluster contains terms that are likely to represent a specific domain. The clusters are annotated according to the interpretation of the terms in the clusters

Timeline of country publications on renewable energy

Above bump chart illustrates a timeline of different countries’ activities in the field of “renewable energy” between 1998 and 2018. It is based on our database that compiles the top ten percent most cited articles as published by countries in the time period. The stream represents the ten countries in different colors and the changes in their citation amounts over 20 years. The area that different countries cover represents the quantity of publication-counts and their placement from top to bottom depicts a ranking of those counts.
Aggregated count of publications on renewable energies by the 10 most publishing 10 most publishing countries (the 10% most cited) between 1998–2018.


Welcome to a place where words matter. On Medium, smart voices and original ideas take center stage - with no ads in sight. Watch
Follow all the topics you care about, and we’ll deliver the best stories for you to your homepage and inbox. Explore
Get unlimited access to the best stories on Medium — and support writers while you’re at it. Just $5/month. Upgrade