public marks

PUBLIC MARKS with tags data & web

This year

High Scalability - High Scalability - Tumblr Architecture - 15 Billion Page Views a Month and Harder to Scale than Twitter

by karlcow

Growing at over 30% a month has not been without challenges. Some reliability problems among them. It helps to realize that Tumblr operates at surprisingly huge scales: 500 million page views a day, a peak rate of ~40k requests per second, ~3TB of new data to store a day, all running on 1000+ servers.

distributed… owning our data… etc. *sigh*

2011

Tom Morris - I’m not an experience-seeking user, I’m a meaning-seeking human person

by karlcow

excited about on the web: having technology as a way for people to establish an educational, interactional feeling with the world around them, to hack the world, to hack their context, to have the web and to have data as another layer on top of the world.

Un web ouvert, décentralisé et indépendant | znarf (blog)

by Monique

Après les standards du web et les microformats, Tantek Çelik a choisi de promouvoir une nouvelle chose: l’IndieWeb.

2010

A predictable web of data - the why of YQL - free book excerpt

by ghis
Great introduction to YQL (Yahoo Query Language), a query language for easily integrate APIs.

2009

An introduction to Opera Unite - Opera Developer Community

by karlcow

In a nutshell, Opera Unite is a collaborative technology that uses a compact server inside the Opera desktop browser to share data and services. You can write applications — in the form of Opera Unite Services — that use this server to serve content to other Web users.

Write Web Of Data - ESW Wiki

by karlcow

idea of realizing a write-enabled Web Of Data.

Privacy Diffusion on the Web: A Longitudinal Perspective | Semantic Web Dog Food

by karlcow

or the last few years we have studied the diffusion of private information about users as they visit various Web sites triggering data gathering aggregation by third parties. This paper reports on our longitudinal study consisting of multiple snapshots of our examination of such diffusion over four years. We examine the various technical ways by which third-party aggregators acquire data and the depth of userrelated information acquired. We study techniques for protecting against this privacy diffusion as well as limitations of such techniques. We introduce the concept of secondary privacy damage. Our results show increasing aggregation of user-related data by a steadily decreasing number of entities. A handful of companies are able to track users' movement across almost all of the popular Web sites. Virtually all the protection techniques have significant limitations highlighting the seriousness of the problem and the need for alternate solutions.

Conferences Web of Data - semanticweb.org

by karlcow

current state-of-the-art regarding the handling of conference metadata and possible future developments.

Using the Web as our Content Management System on the BBC Music Beta - WWW2009 EPrints

by karlcow

In this paper, we describe the BBC Music Beta, providing a comprehensive guide to music content across the BBC. We publish a persistent web identifier for each resource in our music domain, which serves as an aggregation point for all information about it. We describe a promising approach in building web sites, by re-using structured data available elsewhere on the Web --- the Web becomes our Content Management System. We therefore ensure that the BBC Music Beta is a truly Semantic Web site, re-using data from a variety of places and publishing its data in a variety of formats.

Global Consciousness Project Dot - Correlated Structures in Random Data

by ycc2106
Webpage add-on button that changes color The Global Consciousness Project collects random numbers from around the world. These numbers are available on the GCP website. This website downloads those numbers once a minute and performs sophisticated analysis on these random numbers to see how coherent they are. That is, how probable it is that the numbers are generated as they are. The theory is that the Global Consciousness of all the people of the world affect these random numbers... Maybe they aren't quite as random as we thought.

Using the Semantic Web for Genealogy

by xibe & 1 other

(...) genealogy seems to be an obvious application of an RDF ontology and the Semantic web. I've investigated making use of RDF and the Semantic Web for Family History. The results of my investigation are here on this web page. In my work, I created a program to translate files in GEDCOM format to XML. I also wrote several stylesheets which translated the data into a new format GEDCOM XML, HTML, and RDF.

Evolution of the Web from 2000 to 2007 - average web object size quintuples since 2000

by karlcow

Summary: In a comparative survey of data traces from 2000 and 2007, University of Twente researchers found that the nature of the Web has changed from a static one-way medium to a dynamic platform for interactive services such as photo and video sharing portals.

The Web has changed dramatically over the past seven years. During that time the Web has moved from a static one-way medium toward a dynamic platform for interactive services such as photo and video sharing portals. In a comparative survey of data traces served over the Web from 2000 and 2007, University of Twente researchers found that the nature of web sites has changed (Sadre and Haverkort, 2008).

Data Visualization: Modern Approaches | Graphics | Smashing Magazine

by Xavier Lacot & 11 others
Smashing magazine describes several approaches to data visualization. Neat and powerful ideas are presented in the article.

2008

datatainment : play data ! play !

by CharlesNepote

datatainment, is a portmanteau word combining “data” and (enter) “tainment” which refers to a new way of representing digital data related to the activities of individuals within the information world.

protobuf - Google Code

by kuruzman & 3 others
Protocol Buffers are a way of encoding structured data in an efficient yet extensible format. Aka JSON.

It's all about you - Carnets de La Grange

by greut & 3 others, 3 comments

Le Web 2.0 a fonctionné sur le thème du « It's all about you » qui se résumait en fait à « It's all about your data for our advertisement incomes. »

très très justement décrit. Tant de considérer les services 2.0 comme périphériques et plus centraux et d'avoir un serveur perso par la même.

PUBLIC TAGS related to tag data

analytics +   google +   statistics +   stats +   web +  

Active users

karlcow
last mark : 14/02/2012 14:19

Monique
last mark : 15/01/2011 13:52

ghis
last mark : 01/11/2010 14:39

innipukinn
last mark : 18/10/2010 15:54

simo
last mark : 26/09/2010 13:58

topdos
last mark : 14/05/2010 16:32

webs
last mark : 12/01/2010 10:00

nvukosav
last mark : 19/11/2009 11:56

moyamada
last mark : 23/10/2009 22:11

4004
last mark : 12/09/2009 21:14

ycc2106
last mark : 21/04/2009 06:47

xibe
last mark : 14/03/2009 12:44

Xavier Lacot
last mark : 18/01/2009 19:54

rwatuny
last mark : 09/12/2008 18:25

CharlesNepote
last mark : 07/11/2008 15:37

kuruzman
last mark : 17/10/2008 21:56

greut
last mark : 14/10/2008 10:51

gardenclogs
last mark : 26/04/2008 00:15