A partial archive of https://score.community/ as of Monday March 04, 2024.

Taxonomy Manager – Concept & working group progress discussion

pjppauwels

LAB9K at Digipolis Ghent has been experimenting with creating a taxonomy tool for Open City Data. This has been a challenge for quite a while and it seemed this might be a decent meta solution to a lot of partner cities. The main idea is that this tool contains all the necessary taxonomies and relations between those to see how the open data is structured.

The tool itself is based upon the SKOS specification, defined by W3C https://www.w3.org/TR/2005/WD-swbp-skos-core-spec-20051102/ and shows

Atramhasis is an online SKOS editor. This web-application enables users to create SKOS vocabularies consisting of Concepts and Collections. It tries to stay close to the SKOS specification. This makes it suited for different types of vocabularies such as simple pick lists, authority files, flat lists and basic to relatively complex thesauri.

It’s not completely new, it’s based upon an open editor created by Onroerend Erfgoed Vlaanderen (EN: immovable(?) heritage Flanders)

First Ghent version: https://atram.lab9k.gent/
Existing codebase: https://github.com/OnroerendErfgoed/atramhasis
Documentation: http://atramhasis.readthedocs.io/

What do you think? Have an alternative at your city? Or best practice?
Let us know!

Kind regards,
Pieter-Jan

timvanachte

This working group proposal is one of the proposals from Ghent we have analysed more in depth + actualised using the “working group proposal” template.

Updated title: Linked Open Data Taxonomy Manager
See this doc in shared drive for additional information about the initiative.

(Background information on template used: Partnership Update: Overview of activity for September, October and November 2018)

timvanachte

At one of our final sessions of the meeting last week (WP next steps), the concept of linked open data was discussed. There seemed to be interest among several cities (Ghent, Bradford University, Aarhus, others?) for working on identifying relevant and useful solutions within the concept of linked open data in SCORE.

Therefore, allow me to bump this workgroup proposal from Ghent about a taxonomy manager for linked open data. Such a manager would facilitate a common vocabulary (terms + relations between these words) among different partner cities (cfr. the gritting / salting example). Its impact would potentially even reach beyond SCORE, as it would provide its resulting data and tools as a starting point for non-SCORE cities.

Especially when we came to one of the core questions: “Which kind of challenges are the ones that truly depend on the multi-city perspective and mutual benefits to be kickstarted?”, it are challenges such as these that come to mind.

Other option: in a working group we could ofcourse explore this linked open data challenge a bit broader, and identify other potential solutions based on the actual linked open data ambitions of all interested partners.

What does it do?

The tool allows cities to create and manage taxonomies, lists of words, including each word’s definition and how words relate to each other in the city context. Such taxonomies are typically used as exchangeable backend and frontend data structures for database indexing, retrieval, filtering, selection, navigation etc. The tool acts as a data framework, integrating very well with other open source systems and modules. However, and more importantly for the City of Ghent, the use of such taxonomies also enables city departments to provide their open data (privately and publicly) as linked data, enabling not only linking the different types of data that live within departments, but increasingly facilitating the exchange of data in between departments and even in between different cities or different levels of government. All of this is made possible because the data structure allows to define how different terms (and thereby different pieces of data) relate to each other in the city’s context.

What does it solve?

It allows for structure and unity in data formats among different city stakeholders (eg. different city departments, different cities, citizens, different backend and frontend technologies used, …) Dealing with this important obstacle, it facilitates collaborations among these stakeholders when they intend to exchange and provide data openly and link different pieces of data together.

turegjorup

@timvanachte Sounds interesting, and linked data is definably a topic we’ll be watching. But right now, we don’t have manpower to commit to development on this.

However, if you decide to work on this we might be able to participate as a “validating partner”, meaning we’ll provide feedback and test the solution.

timvanachte

Good to know @turegjorup, thanks. We’ll wait for other interested partners (30 days we agreed on for a reaction was it? :wink: ) in, indeed, the different possible roles: co-developer, replicator, tester, giver of feedback or just follower. I’ll let you know if an initiative related to linked data is starting.

dhaval

HI @timvanachte, indeed. Thanks for initiating this. We are interested in this initiative from the perspective of building on existing taxonomies/ontologies to help SCORE partners to publish or use more of linked data. I look forward to more discussion on this topic. Thanks Dhaval @rayemson

timvanachte

Hi @turegjorup and @dhaval

Based on your reactions here I took the liberty of setting your status (Aarhus and Bradford University) to ‘Yes, as follower’ for the Taxonomy Manager proposal in the solution proposal dashboard.

timvanachte

Ghent has decided to start developing this, in collaboration with the two follower partners. Of course anyone can still join later.

By working on this taxonomy manager we’ll also try to get partners as convinced as us that taxonomy management is a key element for enabling connections between open source components, data flows and eventually between cities.

@claus @Boris Can a working group be created on the community where we can kickstart this initiative? First step will be ofcourse to develop a work plan based on the template.

claus

@timvanachte done

timvanachte

The Taxonomy Tool was demo’ed and discussed by @Jefwillems and me at a working table at the partner meeting in Aarhus, together with @turegjorup @nicole.schubbe @Vbyrne, Kim Søvsø from Aarhus, and @JoranVD @Justine

Thank you all for joining and contributing to the results below:

Use cases/ideation

  • Identify where there is most need for more understandable, ‘common terminology’ towards citizens (structure information in a way that it makes sense “in daily life” rather than the way a local authority happens to be organized)
  • Align along smart cities up similar information structures we use in web portals, web sites, applications. Share (translated versions of) the taxonomies we use, adapt to each other’s insights. Provide more similar facet navigation in open data portals across countries. (Hamburg)
  • Develop a taxonomy for richer tagging of open datasets, so dataset/data portal responsibles can more efficiently handle repeatedly incoming questions on availability of data “Does our city have data on …?” Combine with the SCORE visualiser tool? (Hamburg, Ghent)
  • Align taxonomy filters/faceted navigation for open datasets, both as internal tagging as external public tagging use case (Aarhus, Hamburg, Ghent)
  • Improve (title/label) search in applications thanks to underlying taxonomy that is made ready to ‘capture’ and map common search terms to the right results

Use cases within SCORE:

  • Scan ongoing and emerging SCORE solutions on need for common terminology (one solution will be more suitable than the other, taxonomy should not be overkill but make life easier)
  • Keep possibilities of taxonomies in mind, spotting opportunities when challenge working groups go in ideation coming months

Barriers identified

  • One or more key users needed in your organisation that know how tool works, and combines this with coordinating the offline workflow (organise brainstorms, meetings, converging to the actual agreed taxonomy)
  • When to convince to use the tool and not traditional tools (eg Excel): at which point does it have added value to just using Google Sheets?
  • Scan our portfolio of applications and find a taxonomy complex enough to work on (At which point is a full blown taxonomy needed? At which point is structured open data valuable enough to efforts put in?)

Possible backlog items/open source contributions & features

  • Authentication layer/mechanism after ‘Admin’ click (to protect the data entered)
  • Create new taxonomy as user, not as coder
  • Import existing data structures (rework), eg. Hamburg open data portal, a) as user, not as coder b) import a static file, not URL