This year
High Scalability - High Scalability - Tumblr Architecture - 15 Billion Page Views a Month and Harder to Scale than Twitter
distributed… owning our data… etc. *sigh*Growing at over 30% a month has not been without challenges. Some reliability problems among them. It helps to realize that Tumblr operates at surprisingly huge scales: 500 million page views a day, a peak rate of ~40k requests per second, ~3TB of new data to store a day, all running on 1000+ servers.
2011
Tom Morris - I’m not an experience-seeking user, I’m a meaning-seeking human person
excited about on the web: having technology as a way for people to establish an educational, interactional feeling with the world around them, to hack the world, to hack their context, to have the web and to have data as another layer on top of the world.
Un web ouvert, décentralisé et indépendant | znarf (blog)
Après les standards du web et les microformats, Tantek Çelik a choisi de promouvoir une nouvelle chose: l’IndieWeb.
2010
A predictable web of data - the why of YQL - free book excerpt
2009
An introduction to Opera Unite - Opera Developer Community
In a nutshell, Opera Unite is a collaborative technology that uses a compact server inside the Opera desktop browser to share data and services. You can write applications — in the form of Opera Unite Services — that use this server to serve content to other Web users.
Privacy Diffusion on the Web: A Longitudinal Perspective | Semantic Web Dog Food
or the last few years we have studied the diffusion of private information about users as they visit various Web sites triggering data gathering aggregation by third parties. This paper reports on our longitudinal study consisting of multiple snapshots of our examination of such diffusion over four years. We examine the various technical ways by which third-party aggregators acquire data and the depth of userrelated information acquired. We study techniques for protecting against this privacy diffusion as well as limitations of such techniques. We introduce the concept of secondary privacy damage. Our results show increasing aggregation of user-related data by a steadily decreasing number of entities. A handful of companies are able to track users' movement across almost all of the popular Web sites. Virtually all the protection techniques have significant limitations highlighting the seriousness of the problem and the need for alternate solutions.
Conferences Web of Data - semanticweb.org
current state-of-the-art regarding the handling of conference metadata and possible future developments.
Using the Web as our Content Management System on the BBC Music Beta - WWW2009 EPrints
In this paper, we describe the BBC Music Beta, providing a comprehensive guide to music content across the BBC. We publish a persistent web identifier for each resource in our music domain, which serves as an aggregation point for all information about it. We describe a promising approach in building web sites, by re-using structured data available elsewhere on the Web --- the Web becomes our Content Management System. We therefore ensure that the BBC Music Beta is a truly Semantic Web site, re-using data from a variety of places and publishing its data in a variety of formats.
Global Consciousness Project Dot - Correlated Structures in Random Data
Using the Semantic Web for Genealogy
(...) genealogy seems to be an obvious application of an RDF ontology and the Semantic web. I've investigated making use of RDF and the Semantic Web for Family History. The results of my investigation are here on this web page. In my work, I created a program to translate files in GEDCOM format to XML. I also wrote several stylesheets which translated the data into a new format GEDCOM XML, HTML, and RDF.
Evolution of the Web from 2000 to 2007 - average web object size quintuples since 2000
Summary: In a comparative survey of data traces from 2000 and 2007, University of Twente researchers found that the nature of the Web has changed from a static one-way medium to a dynamic platform for interactive services such as photo and video sharing portals.
The Web has changed dramatically over the past seven years. During that time the Web has moved from a static one-way medium toward a dynamic platform for interactive services such as photo and video sharing portals. In a comparative survey of data traces served over the Web from 2000 and 2007, University of Twente researchers found that the nature of web sites has changed (Sadre and Haverkort, 2008).
Data Visualization: Modern Approaches | Graphics | Smashing Magazine
2008
datatainment : play data ! play !
datatainment, is a portmanteau word combining “data” and (enter) “tainment” which refers to a new way of representing digital data related to the activities of individuals within the information world.
protobuf - Google Code
It's all about you - Carnets de La Grange
Le Web 2.0 a fonctionné sur le thème du « It's all about you » qui se résumait en fait à « It's all about your data for our advertisement incomes. »
très très justement décrit. Tant de considérer les services 2.0 comme périphériques et plus centraux et d'avoir un serveur perso par la même.










