Looking for a way to automatically create meta data?
A new web service has launched which generates RDF for any text you send along. The semantic web is now one step closer to being something real and usable.
I'm still looking into it - looks like there are some Ruby and Java sample applications.
The Drupal post which alerted me to it:
http://groups.drupal.org/node/8801
Open Calais:
http://opencalais.mashery.com/Overview
There's a few anomalies with the current setup.
First, Sesame's memory store is truly stored in memory, and I access via both a custom app, and the standard REST interface. This produces concurrency issues, since the Sesame application stored for the REST interface doesn't seem to sync as often. Also, the console fails to catch this unless the url at http://localhost:8080/openrdf-sesame has its web container restarted.
These are all stored in memorystore.data. I plan to ask if sync is set to 0, how long does it take?
The module is being updated to make sure it works well in Java 6 and with the latest Sesame 2.0 release.
There are still a number of bugs with the pilot application. The application is being tested on OSX and will a launch on windows server 2003. The three major hurdles are
SONIA, when designed a year ago, was meant to be a set of three apis to simplify building PHP apps against various RDF stores. The concept has been proven to work - but a lot of work remains. Here's a rough UML sketch of that work from one year ago.

Semantic Search is free software licensed under the GPLv2. You can download the software and use it, as well as submit bug reports and feature requests, at http://drupal.org/project/semantic_search. The software is provided WITHOUT warranty and is currently alpha quality.
Some feature requests require significant development time, and require funding. A number of features are planned which would benefit from financial support. New features will be folded back into the open source code, so this support will benefit the Drupal community and the larger internet community.
You are looking at the speed of AHIRC front page loads now, after another 18 hour day. This is 3-12X faster than the last post!!

Looks like I'll be using a hybrid approach of the HTTP client, and a custom class running on JAVA to do the aggregation emulation.
Here's a recap of the consequences.
The first points of failure are (since the HTTP client for Sesame works great):
* SPARQL/Sesame not having Aggregate functions
* Sesame not having ORDER BY
This produces large amounts of results and/or queries, which then need to be parsed by JSON, leading to the second point of failure
* Zend JSON and native php 5.2 JSON are not fast enough (perhaps they should not be expected to be for 6000 results)
212 queries to load the front page of AHIRC. That'll do it.
This happens because I am trying to emulate the "COUNT" mixed with "GROUP BY" you'd find in SQL. One company has it:
http://docs.openlinksw.com/virtuoso/rdfsparqlaggregate.html
And looks like others are anxious:
http://simile.mit.edu/mail/ReadMsg?listId=14&msgId=16326
But it's not an easy problem in a broader context:
http://www.ldodds.com/wordtin/Wiki.jsp?page=UniqueNamesAssumption
The client is nearing completion. I'm working on create, update, and Delete. I'll post a version of the file on this site when done
Thanks for the help from Sesame's project leader in the following forum thread.
http://www.openrdf.org/forum/mvnforum/viewthread?thread=1417