coli-conc Logo

coli-ana

coli-ana is a project aimed for the analysis of synthesized DDC numbers. The number analyzer (vz_day) is being developed as part of the VZG project colibri, of which coli-conc is also a sub-project. coli-conc plans to intergrate the results of coli-ana into its infrastructure and make available information on decomposed DDC numbers in different formats (JSKOS, MARC, PICA...). The results can be of a great value for subject-indexers, concept mapping and for the research community.

A tool to query and display analyzed DDC numbers is made available for testing. By now it only contains a limited set of notations imported from vz_day:

Start coli-ana release version

Start coli-ana development version

Example

Given a synthesized DDC number, it can be hard to find out how the number was build. For instance the DDC number 700.90440747471 contains the following DDC classes:

With coli-ana it is possible to look up the DDC number with every single DDC class contained. The result can also be queried via an API in JSKOS data format and in other forms.

Documentation

A brief introduction to coli-ana was given in this Code4Lib 2021 lightening talk.

The primary application of coli-ana is to enrich the K10plus union catalog. Bibliographic records with DDC number will be extended in PICA format.

More information about coli-ana can be found in the presentation Automatic Analysis of DDC Numbers based on MARC21 given by Ulrike Reiner at the EDUG Symposium 2016 and in the article Automatic Analysis of Dewey Decimal Classification Notations (2008). An earlier attempt to decompose DDC numbers was conducted by Songqiao Liu, see the article Decomposing DDC Synthesized Numbers (1996).