coli-conc Logo

coli-ana

coli-ana is a tool that analyzes and decomposes synthesized DDC numbers. The number analyzer (vc_day) has been developed as part of the VZG project colibri, of which coli-conc is also a sub-project. The results of coli-ana has been intergrated into the infrastructure of coli-conc and information on the decomposed DDC numbers are provided in different formats (JSKOS, MARC, PICA...). The results could be of a great value not only for subject indexers and for concept mapping, but also for the research community.

A tool to analyze any given synthesized DDC number is made available for trial purposes:

Start coli-ana release version

Start coli-ana development version

Example

For a given synthesized DDC number, it can be hard to find out, how the number was built. For instance, the DDC number 700.90440747471 contains the following DDC classes:

coli-ana enables to analyze the DDC number and determine every single DDC class that was used to build the number. The result can also be queried via an API in JSKOS data format and in various other formats.

Documentation

Forthcoming presentation on coli-ana at SWIB Conference 2021: Automatic Analysis of the Dewey Decimal Classification: A Service of the Verbundzentrale des GBV

A brief introduction to coli-ana was given at: in this Code4Lib 2021 lightening talk.

The primary goal of coli-ana is to enrich the K10plus union catalog. Bibliographic records with DDC number will be extended in PICA format.

More information about coli-ana can be found in the presentation Automatic Analysis of DDC Numbers based on MARC21 given by Ulrike Reiner at the EDUG Symposium 2016 and in the article Automatic Analysis of Dewey Decimal Classification Notations (2008). An earlier attempt to decompose DDC numbers was conducted by Songqiao Liu, see the article Decomposing DDC Synthesized Numbers (1996).

FAQ

What is the difference between the development and the release version?

The development version contains all the latest features and changes. It usually represents the most current status of development of the tool. However, this means that it might also contain bugs or other issues that haven't been fixed yet.

When the development version reaches a point where bugs are fixed and everything works well, it will be merged into the release version. Therefore, the release version is always a bit behind than the development version, but is more stable and less likely to break.

What is the difference between "analyze" and "lookup"?

In the coli-ana web interface, there are two main buttons: "analyze" and "lookup". "analyze" analyzes a given DDC number and shows how the number was built. This is usually performed on the fly and gives the latest analysis of the vc_day number analyzer.

"lookup", however, uses pre-analyzed DDC numbers of the GVK union catalog (soon to be updated to the latest data of the K10plus union catalog) and searches for a given number within those analyses. This means that a lookup will show the DDC notations which were used in the sythesis of one of the DDC built numbers. This currently has limited application since the result sets are usually very large and can't be fully explored in the interface.

I have got a different result for the same number. How can that happen?

There are two reasons why a result can differ from previous results:

  1. The vc_day number analyzer was updated and the analysis of the given number has changed.
  2. There is a problem in the backend and the result comes from a database cache (that potentially contains old results).

The database cache contains pre-analyzed numbers from the GVK union catalog and might not have used the current version of the number analyzer.

Where do I find details to each DDC class?

Details about each element of a built number are displayed on mouseover.

How do I access the rules applied in the analysis of a built number?

It is planned to display rules applied in the analysis of each built number.

How do I determine the base number of a given DDC number?

A base number of a DDC number is a number of any length to which other numbers are added. It is planned to highlight base numbers in the analysis.

What other features does the coli-ana web-service provide?

The webservice offers two other features:

  1. Deep links to the title data of the union catalog K10plus.
  2. A seamless transition to the mapping tool Cocoda.