public marks

PUBLIC MARKS with tags data & websemantique

2011

Official Google Blog: Introducing schema.org: Search engines come together for a richer web

by karlcow

introduces schemas for more than a hundred new categories, including movies, music, organizations, TV shows, products, places and more.

ironic how Google is finally getting the way of the old Yahoo!

2010

google-refine - Project Hosting on Google Code

by karlcow

Google Refine is a power tool for working with messy data, cleaning it up, transforming it from one format into another, extending it with web services, and linking it to databases like Freebase.

An RDF wishlist from Dan Brickley on 2010-07-01 (semantic-web@w3.org from July 2010)

by karlcow

The very nature of RDF makes it somewhat annoying to work with. RDF data is always going to be a kind of frankenstein's data monster, patched together from bits and pieces that can just about be made to fit together.

Provenance Vocabulary Core Ontology Specification

by karlcow

The Provenance Vocabulary provides classes and properties to describe the provenance of data from the Web. Hence, this vocabulary enables providers of Web data to publish provenance-related metadata about their data. The Provenance Vocabulary Core Ontology provides the main classes and properties required to describe provenance of data on the Web. Notice, this vocabulary is not designed to describe provenance of other kinds of content such as documents.

the wheel and the hub: societas hominum et societas rerum

by karlcow

And of course he has triggered the usual bunch of complaints about it. Tools are too technical, stuff is presented by geeks for geeks, data are boring, we need betteer user interfaces etc. Among many smart but technical proposals, basically adding to the general complexity issue they are supposed to solve, I will pick up this very simple one by Karl Dubost.

ACTION : Tell a story to people

linkedgeodata - Project Hosting on Google Code

by karlcow

LinkedGeoData is an effort to add a spatial dimension to the Web of Data / Semantic Web. LinkedGeoData uses the information collected by the OpenStreetMap project and makes it available as an RDF knowledge base according to the Linked Data principles. It interlinks this data with other knowledge bases in the Linking Open Data initiative.

2009

APIs and Lists from Jeni Tennison on 2009-12-13 (public-lod@w3.org from December 2009)

by karlcow

Dave (Reynolds) raised the point that lists are an integral part of most APIs. This is another thing that we know we need to address in the UK linked government data project, but are unsure as yet how best to do so.

What else? « Web of Data

by karlcow

The non-RDF bits of the data Web are – roughly – going to be the leaves on the tree.

The Third Bit » Blog Archive » Data Collaboration

by karlcow

He discusses this further in his post on influencing the production of public data. So let me throw it open: what do you want your city/county/province/national government to put online for you to play with, and why?

Stefano’s Linotype » On Data Reconciliation Strategies and Their Impact on the Web of Data

by karlcow

No matter what user interfaces will drive the user interaction, the dream of being able to search the web of data following relational connections (say, somehow looking for “the height of all towers located in Paris”) dies miserably when it’s powered by a vastly sparse and unconnected graph.

Turning the read-only Web of Data into a read-write Web of Data on Vimeo

by karlcow

We introduce pushback, a method that enables writing changes to non-RDF sources such as flickr, Twitter, Amazon, etc. from an RDF document.

The video explains our motivation, the architecture and the interaction between the components as well as RDForms. A demo (for Jira, a professional issue tracker system) is included in this video, where we show how to create, deploy and use an RDForm.

See esw.w3.org/topic/PushBackDataToLegacySources for further information.

How FriendFeed uses MySQL to store schema-less data - Bret Taylor's blog

by karlcow & 6 others

In particular, making schema changes or adding indexes to a database with more than 10 - 20 million rows completely locks the database for hours at a time.

Graphe, graphe, graphe !

~wingerz » Using Solvent to extract data from structured pages

by karlcow

There is a lot of structured data in web pages. While this data is usually backed by structured storage of some sort, a lot of the semantics of the data are lost by the time the page is rendered in the web browser.

Introduction to Information Retrieval

by karlcow

Apart from small differences (mainly concerning copy editing and figures), the online editions should have the same content as the print edition. However, we are planning to fix errata in the online editions every few months or so.

Google: "We're Not Doing a Good Job with Structured Data" - ReadWriteWeb

by karlcow

Google's Alon Halevy admitted that the search giant has "not been doing a good job" presenting the structured data found on the web to its users. By "structured data," Halevy was referring to the databases of the "deep web" - those internet resources that sit behind forms and site-specific search boxes, unable to be indexed through passive means.

2008

Messages in a bottle » Blog Archive » Descriptive markup and data integration

by karlcow

Bear in mind that data integration and aggregation (whether large-scale or small-) are intrinsically, necessarily, kinds of data reuse. No data reuse, no data integration.

2005

PUBLIC TAGS related to tag data

analytics +   google +   statistics +   stats +   web +  

Active users

karlcow
last mark : 03/06/2011 09:37