Prepared Database

From SONIVIS:Wiki

Jump to: navigation, search

On this page, we provide some preprocessed databases.

Contents

Beginning an Analysis with Wikiversity

About the Wikiversity project

Wikiversity is a project of the Wikimedia Foundation. The objective of Wikiversity is "to further the discovery and distribution of knowledge in a very natural way, by helping each other to learn.". You can find further information on the German or English websites.

If you have loaded the german Wikiversity data, the following categories are some of the more interesting ones for a WikiLink network, as they have than 50 nodes/articles:

  1. Vorlage:Babel-Sprache (55 nodes)
  2. Vorlage:Projektarbeit (60 nodes)
  3. Projekt:Mathematik_ist_überall/Medien (68 nodes)
  4. Fachbereich_Mathematik (73 nodes)
  5. Schulprojekt:Hallo_Rohstoff!/Miniwikipedia (73 nodes)
  6. Wikiversity (85 nodes)
  7. Kurs (109 nodes)

For interesting collaboration networks, try category "Fach:Physik" or namespace 106.

Size-Reduced version of Wikiversity

If it is you first time using SONIVIS please use this database.

As network and text analysis algorithms implemented in SONIVIS can be very time-intensive on big networks, you should use a small-scale network to begin your first steps with the tool.

We provide a size-reduced version of the precalculated wikiversity databases for this. Download:

An explanation howto use these databases you will find here.

Full Wikiversity data sets

If you are an experienced user, please use these databases.

If you have a concrete idea what to analyze in a wiki, begin to conduct your analysis ideas on a real data set.

We have prepared two complete data sets with unmodified data of the Wikiversity project:

Please note: Both databases need a very long processing time if you calculate all metrics by once. Before beginning an analysis, you should activate only the metrics that you need in the Metrics Preferences dialog. Additionally, you should limit your analyses by filtering the network by category. This can save you a lot of time.

(And by the way, there is a really helpful article that describes standard use cases when querying a MediaWiki database on MediaWiki.)

Further datasets

Enron

Articles

Weblogs

Misc

Personal tools