Sam Leon - May 21, 2012 in Events, Workshop
Availability Monday at Goldsmiths for TEXTUS user testing workshop. Tuesday-Thursday C4CC. Friday at Natural History Museum for open digitisation project event.
- TEXTUS user testing workshop at Goldsmiths with Tom Oinn (Monday)
- Update Wiki for progress on all user stories
- Write up of user testing workshop at Goldsmiths
- Set up remote TEXTUS user testing for 100+ user testers who have signed up
- Organisation of reading group sessions to begin in June at Goldsmiths
- TEXTUS budget review with Jilly
- Open Digitisation Project event with Collections Trust and Open Right Group at Natural History Museum (Friday)
- Finish first in series of key concept in open cultural heritage metadata with Primavera for openglam.org and UK Discovery
- More #BiblioHack workshop promotion and invites
- Continue storyboarding for open licensing animation with Joris
- Edit and publish new guest post from The Archive Platform on openglam.org
- Find and book food and venue for meet-up before #BiblioHack
- User testing workshop planning
- Emails to university departments concerning user testing
- Open Humanities Working Group call with new volunteer Sarah plus discussion of open humanities presence at OKFest — see notes
- #BiblioHack workshop and hack organisation
- Open GLAM stream OKFest planning – calls, confirmation of venues, questions and preparation for 2nd call
- Challenges blog post on openglam.org
- PM handover calls and budget reviews with Jilly
- Organisation of Adivsory Board Meeting in Berlin
- Boosting OpenGlam handle on Twitter – follow us!
- Discussion with Wittgenstein Archives re: use of their data and documents in Hack4Europe in Berlin. Decision that data not ready.
- Setting up new email aliases
- Approving Histories of Open Knowledge Project – next steps to follow shortly
Rufus Pollock - June 6, 2011 in CKAN, Events, Workshop
Notes from the workshop.
- Set agenda and outline for the day
- Martin – software engineer. Interested in design and how government works.
- Chrastian – ontotext. Interested in open semantic data. http://www.ontotext.com/
- Elena – from Sofia University (teach Sociology). Teach course on content analysis. Excited that there is growing interested in public data. You can process a lot but need a purpose.
- Martin – OpenStreetMap’er. How can we integrate with other data e.g. missing people
- Ivo – works for ontotext
- Galia – just interested in open data
- Bogdаn – software author. Curiosity!
- Peio – legal adviser by day, IT background. Curious.
- Plamen – ex-software engineer. Aggregating data from bulgarian parliament.
- Alex – interested in using new technologies, electronics and music!
- Stoian Mishinev – IT Specialist
- Yana Petrova – journalism student
- Data (and problem) mapping
- Problems with getting data
- Tools for working with data and developing a community around it (using it)
Gov data mapping
- Civic info
- News / Gazettes
Government structure in Bulgaria
- Central Gov – executive and parliament and courts
- Regions (28)
- Municipalities (cities are sometimes municipalities by themselves)
- Districts (possibly)
- (mayors in smallest villages)
Legal status for gov material (e.g. legilslation) — ЗАПСП http://lex.bg/bg/laws/ldoc/2133094401 Член 4, точка 4 Не са обект на авторско право
Question: how far does this extend to all documents.
The law: in state gazette: mostly online (html and pdf)? http://dv.parliament.bg
4th tab link on: http://dv.parliament.bg/ (no direct link because no urls!)
Committee debates: http://www.parliament.bg/bg/parliamentarycommittees/members/226/steno
Plenary sessions debates: http://www.parliament.bg/bg/plenaryst
Have CKAN package: http://ckan.net/package/bg-budget
- Trains: Publicly owned
- Trams in sofia: Publicly owned
- Bus: part private / part public http://www.sofiatraffic.bg/
- Subway: publicly owned
Civic Info (Health, Education etc)
The company register was publicly available until 2011; at some point in 2011 it has been closed and access to it is available for a fee.
[ACTION: Peio – get old dump and analysis and add to relevant CKAN dataset]
Geodata and Cadastral
Problems getting data
- Gov objections to giving out data (and what can you do about it).
- Data format
- Data persistence
- Data quality
ACTION [Peio]: clarify scope of public domain provision for gov data (is this just legislation and gov documents or all gov data)
What do we do about PDF?
* Ask – directly or via http://isitopendata.org/
* Find a contact if you can
* Find out what the worries are …
* Find tools – http://getthedata.org/questions/339/excel-table-from-a-pdf
[ACTION: Rufus Pollock: ask Julian Todd to write up instructions on PDF parsing based on UNDemocracy experience]
Tools and Communities
- Transform (clean and integrate)
Proprietary but free (in some form or other):
- Google docs and google fusion tables
- Google refine
- Tableau, Needlebase …
Ideas / Wanted
- croudsourcing the collection of all the bulgarian legislative data
- extract structured info from plenary and committee debates
- list of municipalities
- http://wiki.openspending.org/Countries – find volunteers to populate data for the Bulgarian budget
- on time stats for public transport
- wifi locations
- ‘Tell me about my area’ — On my phone (on facebook even!)