Massive lexicon of word relationships

Databases & Networks Massive lexicon of word relationships 279.99 For sale: Over 4 MILLION words and phrases, in several languages, and the semantic relationships between them. Read on. Several years ago, I was doing a project which necessitated finding words that were "related to" one that the user supplied. It wasn't enough to merely import a thesaurus - For instance, if someone typed "lawyer", I didn't want "barrister, attorney, counsel" - I wanted my query to return "divorce, subpoena, accident, lawsuit," etc. Finding that there was no such data easily available even for sale, I had to create my own. Finding words was easy: there are many public-domain dictionaries out there and I merely imported them all and condensed them into one. To find the word relationships, I embarked on a 6 month project of constructing algorithms that find words that are commonly associated by frequency on small documents found on the internet, and in large public-domain literature and periodical collections. The end result is three large SQL tables. The first, "words", contains words in 6 European languages. "related" contains word associations between words using the first of my algorithms, and "associated" contains somewhat looser associations based on the second algorithm. The algorithms used are fairly simple, but the work required to sift the data required several computers running nonstop for nearly all of the 6 months, using custom-made scripts. When all the work was done, the project for which I had originally started this adventure ran out of funding. So now I'm selling to you 6 months worth of intense work for less than $300. What you get here is a Dump of the MySQL database, compressed into a ZIP. When you uncompress it, the resulting text file can be imported into any database management tool to reconstruct the three tables. I guess the thing I'm trying to express here is: this is not a silly little thesaurus. It's a serious pile of data that has had a lot of work put into it, containing word *relationships* that can't be gleaned from any other source. http://www.scubbly.com/item/43981/
Massive lexicon of word relationships
Categoría: Bases de Datos y Redes

Descripción del Producto

For sale: Over 4 MILLION words and phrases, in several languages, and the semantic relationships between them. Read on.

Several years ago, I was doing a project which necessitated finding words that were "related to" one that the user supplied. It wasn't enough to merely import a thesaurus - For instance, if someone typed "lawyer", I didn't want "barrister, attorney, counsel" - I wanted my query to return "divorce, subpoena, accident, lawsuit," etc.

Finding that there was no such data easily available even for sale, I had to create my own.

Finding words was easy: there are many public-domain dictionaries out there and I merely imported them all and condensed them into one. To find the word relationships, I embarked on a 6 month project of constructing algorithms that find words that are commonly associated by frequency on small documents found on the internet, and in large public-domain literature and periodical collections.

The end result is three large SQL tables. The first, "words", contains words in 6 European languages. "related" contains word associations between words using the first of my algorithms, and "associated" contains somewhat looser associations based on the second algorithm.

The algorithms used are fairly simple, but the work required to sift the data required several computers running nonstop for nearly all of the 6 months, using custom-made scripts.

When all the work was done, the project for which I had originally started this adventure ran out of funding. So now I'm selling to you 6 months worth of intense work for less than $300.

What you get here is a Dump of the MySQL database, compressed into a ZIP. When you uncompress it, the resulting text file can be imported into any database management tool to reconstruct the three tables.

I guess the thing I'm trying to express here is: this is not a silly little thesaurus. It's a serious pile of data that has had a lot of work put into it, containing word *relationships* that can't be gleaned from any other source.

$279.99
Aproximadamente $279.99 USD
Usted tiene este ítem en su carro de compras
eliminarlo

Gane $70.66 creando un enlace a esto.
Aprenda cómo hacerlo..

Datos del producto

Nombre de archivo: lexicon.sql.zip
Tamaño: 89MB
Añadido: 15-10-2010

Libre de virus
Última análisis: 01-11-2010

Acerca del vendedor

contacte al vendedor

Otros productos:
King of Wands
King of Wands
$6.00 en Punto de cruz Patrones
Rider-Waite Tarot - the Cups - Cross-stitch Patterns
Rider-Waite Tarot - the Cups - Cross-stitch Patterns
$49.98 en Punto de cruz Patrones

... y 78 más

Comparta Esto

Agregue esto a su lista de deseos ingeniosos
Compartir en Facebook

Enlace a este

URL:
Instant Buy URL:

enlaces de afiliados se muestran cuando se ha registrado pulg
Entra ahora.


Insertar este

Pon este código en tu sitio web para una "descarga inmediata" al igual que el botón de arriba

Más widgets



Retroalimentación

No hay comentarios para este artículo todavía.