Posts by Year

2019

Back to Top ↑

2018

Amplifying Amplify

2 minute read

Amplify is a project maintained by the State Library of NSW for crowdsourcing the transcription of Oral History recordings. It is being used to generate tran...

Back to Top ↑

2017

Back to Top ↑

2016

Docker and MAUS

1 minute read

Today’s problem was to write a wrapper for the MAUS automatic segmentation system in preparation for including it as a Galaxy tool.   MAUS comes from Florian...

Mobile Apps for Aboriginal Languages

12 minute read

My introduction to Darwin was on a borrowed bike used to discover the streets around CDU and eventually making my way to the city and Midil Beach markets for...

Back to Top ↑

2015

Galaxy Tool Generating Dataset Collections

2 minute read

As part of the Alveo project we’ve been using the Galaxy Workflow Engine to provide a web-based user-friendly interface to some language processing tools. Ga...

Back to Top ↑

2011

Notes on Conversion of GrAF to RDF

9 minute read

The Graph Annotation Format (GrAF) is the XML data exchange format developed for the model of linguistic annotation described in the ISO Linguistic Annotatio...

DADA Project Update

2 minute read

The DADA project is developing software for managing language resources and exposing them on the web. Language resources are digital collections of language ...

Back to Top ↑

2010

Back to Top ↑

2009

Arduino & Physical Computing

1 minute read

I gave a talk last week introducing the Arduino platform to some MQ students and staff. It seemed to go well and there is a bit of interest in carrying on wi...

Using Robots to Teach Programming

less than 1 minute read

This is a project idea for an Honours student or similar. Please contact me if you’d like to follow this up.

Back to Top ↑

2008

A RESTful interface to Annotations on the Web

less than 1 minute read

Annotation data is stored and manipulated in various formats and there have been a number of efforts to build generalised models of annotation to support sha...

Sparql Endpoint for Python WSGI

1 minute read

As part of DADA (and yes, that page is a bit out of date) I wanted to provide a Sparql endpoint to allow experimentation with querying the raw RDF annotation...

Back to Top ↑

2007

Version Control for RDF Triple Stores

less than 1 minute read

RDF, the core data format for the Semantic Web, is increasingly being deployed both from automated sources and via human authoring either directly or through...

Screencasts in Teaching Web Technology

2 minute read

I’ve been using screencasts again this year in COMP249 (Web Technology) and have settled on a fairly stable way of producing them using Camtasia on Windows. ...

Welcome COMP249

1 minute read

This is just to welcome any COMP249 (Web Technology) students who might visit following my link from the lecture notes. You’re all welcome to look around at ...

The Machine is Us/ing Us

less than 1 minute read

Here’s an excellent video talking about text, hypertext, touching on the internals of HTML and XML and how Web 2.0 has changed the role of the reader. The we...

Back to Top ↑

2006

SCOPE

1 minute read

So today I make my TV debut! A few weeks ago a film crew from Channel 10 came to shoot a segment for the CSIRO/Channel 10 kids science show SCOPE. The episod...

Transcribed Podcasts and Audio Books

less than 1 minute read

John Udell is taggins some of his del.icio.us links to podcasts with transcriptavailable, transcripts have been generated manually. This could be a nice sour...

All in the Family

less than 1 minute read

My brothers are way more productive than me when it comes to generating cool websites. Via various routes we’ve all ended up working on the web, Patrick on w...

Speaker Tracking In Meetings

less than 1 minute read

This is a potential project idea for an Honours or Masters student. It might also form the core of a PhD project.

Annotation - Spoken Word Services

less than 1 minute read

Annotation - Spoken Word Services is another project that is providing web based annotation of audio recordings, this time in a learning environment.

Back to Top ↑

2005

BBC Annotatable Audio

less than 1 minute read

Tom Coates describes a currently internal BBC intitative to have everyone annotate audio content flikr style. This is a very cool application and is like a r...

What’s he building in there?

less than 1 minute read

What's he building in there? What the hell is he building In there? He has subscriptions to those Magazines... He never waves when he goes by He's hiding som...

Back to Top ↑

2004

Tcl Matrix Type

1 minute read

I’ve just implemented a matrix object type for Tcl, the sources are available here (matrix0.1-src.zip). The package implements a new object type for Tcl whic...

Gnowsis and RDF Desktop Systems

1 minute read

Gnowsis is a Semantic Web desktop System which means it aggregates various bits of personal data into an RDF store and provides a browser for the store. It i...

Semantic Web Talk

1 minute read

I gave a talk last night at the MQ Technology Trends seminar series on the semantic web, my slides are here for those who wanted them.

Bloglines Thinks I’m Italian

less than 1 minute read

So I’ve just started using Bloglines as a blog aggregator and it’s working well thanks to the Firefox plugin that makes subscription and seeing updates easy....

Putting Page Numbers in PDF

1 minute read

One of the annoying things that needs doing when organising a conference is to produce the proceedings. These days that means generating a CDROM filled with ...

SpeechBot

less than 1 minute read

SpeechBot is a is a search engine for audio & video content that is hosted and played from other websites. Recordings are indexed via speech recognition ...

Another Giggle User

less than 1 minute read

So now there are two giggle users (to my knowledge) since I’ve encouraged James to keep a blog of how his research project on Topic segmentation in meetings ...

Back to Top ↑

2003

More RDF Query/Path Stuff

less than 1 minute read

The RDF query/manipulation proposals are coming out of the woodwork on www-rdf-rules:

Political Persuasion

less than 1 minute read

Here’s an interesting test to while away a few minutes. According to the The Political Compass I’m a Leftist Libertarian, just like Ghandi, Mandela and the D...

RDF Path Languages

less than 1 minute read

The world is moving quickly towards defining a path language for RDF and maybe for other more general directed graphs. Here’s a few references:

The problem with being popular…

less than 1 minute read

While it’s nice being the number two ‘Steve Cassidy’ on Google (above the porn star but below the voiceover artist!) being well indexed can have it’s down si...

XTMPath — XPath for Topic Maps

less than 1 minute read

Robert Barta at Bond Uni has a paper on XTMPath, Manipulating Topic Map Data Structures which I should look at a little further. I enjoyed talking with Rober...

Atom REST API

less than 1 minute read

Mark Pilgrim describes his implementation of a REST API for Atom, the RSS successor being developed by various folk. This appeals to my URL designer sensibi...

Extreme Markup 2003: Day 1

6 minute read

Day 1 of the main conference saw an interesting range of papers from hard core modal logic applied to document markup to tips for making XSLT writing easier.

Extreme Markup 2003: Tutorial

less than 1 minute read

Having arrived in Montreal after an Airport Ordeal in Detroit and slept very well at the Bed and Breakfast I turned up at the Hilton a little late for Jonath...

IPAQ Video Conferencing

less than 1 minute read

We’re setting up a teleconferencing facility in order to collect data and experiment within our meeting room project. One of the problems is that we want to ...

W3C RDF Calendar Work

less than 1 minute read

A Working group in W3C is busy worrying about how to store and manage calendar data in RDF including converting the de-facto standard iCalendar format into R...

PDA Audio Recording

less than 1 minute read

Since I bought my Dell Axim I’ve been looking for audio recording and playback products, since the supplied Windows Media software is pretty basic. Because o...

NIST Speaker Recognition Evaluations

1 minute read

Looking at the 2003 NIST SR evaluations, while we’re too late to enter this year there is some useful data available, for example the Automatically Generated...

OSCOM

less than 1 minute read

An article on Advogato describes the efforts of OSCOM to unify Open Source content management systems. Mentions Twingle -

Tim Bray on Good Web Citizenship

less than 1 minute read

Tim Bray talks eloquently about what Apple could do to make their IMS service a good web citizen. Including:

Giggle

1 minute read

So this is Giggle, my weblog system which I’ve ripped off almost completely from Blosxom which is written in Perl. Why? Well, because I wanted to fiddle with...

JXPath

less than 1 minute read

JXPath - JXPath is a java api for traversing object graphs using an XPath like syntax. The collapse the notion of axis down to only ‘child’ which really beco...

Back to Top ↑

2002

RDF model vs. Syntax

less than 1 minute read

Don Box’s Spoutlet: My love affair with RDF began in 1999 when I had to prepare a a tutorial on XML metadata formats for XTech. My RDF love affair was with t...

Children’s Books

less than 1 minute read

The International Children’s Digital Library Has 200 children’s books scanned for public access. I can’t see it because of the Java requirement though 🙁

REST

less than 1 minute read

XML-RPC case study

Zero Install

less than 1 minute read

Don Park’s Blog say’s the net needs zero install extensible client platforms which .Net and java webstart aren’t. perhaps CANTCL can be something like that, ...

PIMs

less than 1 minute read

Open Source Applications Foundation - Vista prototype is another outlook killer, perhaps interesting this time as it’s based on an RDF database underneath an...

Overlapping trees in XML

less than 1 minute read

xmlhack: One tree isn’t enough talks about a couple of proposals for encoding overlapping trees.

Universities obsolete?

less than 1 minute read

In a thread on [Slashdot More on MIT OpenCourseWare](http://slashdot.org/article.pl?sid=02/09/22/1634209&mode=flat&tid=146) the ...

Moving to ICS

less than 1 minute read

So, I’ve finally forwarded my Home Page from SHLRC to ICS, which I guess means that after 9 months here I’ve finally arrived, or rather it means that I’ve no...

Tcler’s Dinner

less than 1 minute read

Had a great dinner chat with Jean-Claude Wippler and Dan Steffan. JC is the author of MetaKit, Starkit and many other interesting things and was visiting Sy...

e4graph

less than 1 minute read

Discovered e4Graph which is a generalised storage system for directed graphs in C++ using MetaKit as the backend and providing C++ and scripting interfaces.

XML and Java

less than 1 minute read

This book swings into my conciousness, first I find it while browsing for a new text for our XML course, then slashdot reviews it. They’re not entirely posit...

Jo Jo Laid an egg

less than 1 minute read

Our Silkie chicken Jo Jo laid her first egg today, joining Betty (an Australorp) who started last week. These chooks have been around for 7 months now withou...

Dasher, text input methods

less than 1 minute read

Dasher is a new input method which uses an ngram language model to predict the next letter you want to enter but lets you select it by ‘flying’ through regio...

The days run away…

less than 1 minute read

Here’s my first blog entry. The blog name is from a Charles Bukowski poem, I don’t so much identify with the content (as usual for him it’s about being drunk...

Arse licker…

less than 1 minute read

I’ll keep calling PM an arse-licker, vows Latham - smh.com.au…is why I like Australia.

Back to Top ↑