public marks

PUBLIC MARKS from parmentierf with tag "moteur de recherche"


Sphinx - Free open-source SQL full-text search engine

by 2 others (via)
How do you implement full-text search for that 10 million row table, keep up with the load, and stay relevant? Sphinx is good at those kinds of riddles.


inkdroid » Blog Archive » crawling bibliographic data

Comment l'OCLC protège son site web des crawlers (robots.txt + sitemaps) - Accueil

Sitemaps permet aux webmasters d'indiquer facilement aux moteurs de recherche les pages de leurs sites à explorer.


WorldWideScience is a global science gateway—accelerating scientific discovery and progress through a multilateral partnership to enable federated searching of national and international scientific databases.

Ecrans - Palettes d’images en un clic

Les moteurs de recherche graphique explorent et dessinent la Toile.

searchCrystal - Home

by 3 others (via)
searchCrystal lets you search and compare multiple engines in one place. It is a search visualization tool that enables you to compare, remix and share results from the best web, image, video, blog, tagging, news engines, Flickr images or RSS feeds.


L'assistant de recherche de Yahoo! Search

Le 6 décembre, Yahoo! a lancé en France son assistant de recherche qui était déjà disponible aux Etats-Unis depuis le mois d'octobre. Cette nouvelle fonctionnalité est destinée à faciliter les recherches en suggérant aux internautes des requêtes par "concepts associés".

KartooVisu -- fr

by 2 others (via)
La technologie utilisée dans Kvisu est baptisée par les concepteurs de l'outil de "cartographie surfacique" : "les mots-clés sont représentés sous forme de surfaces dont la superficie représente la fréquence d'apparition dans les résultats".

Web Searching with Advanced Commands

by 1 other
Seeking out facts, and even basic information on a topic, is relatively easy. Enter 2 or 3 relevant keywords at your favorite search engine. But going beyond the basic, or conducting investigative research, often means using advanced search commands, not to mention additional or more targeted finding tools. This article examines the first issue - using advanced search commands to manipulate or improve search results.

Ubuntu France : le moteur de recherche de la communauté Ubuntu francophone

Ubuntu France est un moteur de recherche ciblant exclusivement la communauté Ubuntu francophone. La sélection de sites utilisés par le moteur vous permet d'effectuer des recherches même imprécises mais fournissant des résultats pertinents, et en français. -- Using the Lucene Query Parser Without Lucene

Stop creating sophisticated search forms. You can use technologies like Ajax to give you the power of creating user friendly interfaces; use ideas like "suggest" or "type ahead"; and create a simpler interface so your users won't feel lost in a huge set of search options. Remember, all users want is to quickly find the information they are looking for. You may stop creating sophisticated and hard-to-maintain search forms, instead providing searches based on Lucene query syntax. You could satisfy your users with a simple search field, as Google does

The Xapian Project

by 10 others
Xapian is an Open Source Search Engine Library, released under the GPL. It's written in C , with bindings to allow use from Perl, Python, PHP, Java, Tcl, C#, and Ruby (so far!)


by 3 others (via)
Thagoo is a tagged search engine that uses popular social bookmarking sites as a reference to give reliable and popular results.

Zend Framework - Zend Search Lucene

Zend_Search_Lucene is a general purpose text search engine written entirely in PHP 5. Since it stores its index on the filesystem and does not require a database server, it can add search capabilities to almost any PHP-driven website.

Luke - Lucene Index Toolbox

by 2 others
Luke is a handy development and diagnostic tool, which accesses already existing Lucene indexes and allows you to display and modify their contents in several ways:

APML - Attention Profiling Mark-up Language

by 4 others (via)
APML will allow users to export and use their own personal Attention Profile in much the same way that OPML allows them to export their reading lists from Feed Readers. The idea is to boil down all forms of Attention Data – including Browser History, OPML, Attention.XML, Email etc – to a portable file format containing a description of ranked user interests.

Official Google Blog: Controlling how search engines access and index your website

by 1 other
This is the first of a series of posts on how to use robots.txt to control access to your content.

techXtra -

Find articles, key websites, books, the latest industry news, job announcements, ejournals, eprints, technical reports, the latest research, thesis & dissertations and more! In Engineering, Mathematics, and Computing


by 21 others (via)
Moteur de recherche qui passe d'abord par un nuage de mots-clés (et par de la co-occurrence).