Re: Literature authors with similar styles
- From: "Purl Gurl" <purlgurl@xxxxxxxxxxxx>
- Date: Thu, 26 Jan 2006 00:31:17 GMT
Matt B wrote:
> As part of a university project I'm working on a piece of software
> which is able to attribute authorship of a document, when presented
> with two candidate authors.
> It uses various statistical techniques in order to do this, and
> requires a large quantity of "training" text - ie works which are
> *known* to have been written by each author.
First order of business for a text parser of this nature is to
scan for character names.
Puck
Willy Loman
Lopakhin
McKendrick
Celimene
Bianca
Oedipus
Laura Wingfield
Thomas Stockmann
Those are character names, a few of thousands of famous character
names which all readers here should instantly recognize, if literate.
A reader should be able to provide the title of the work and the author
without hesitation.
>From a programming point of view, character names are very unique.
Parsing for character names would be a logical initial parsing point.
However, with my having over a decade of programming experience
in Natural Language Emulation, I do know this task of yours is
almost impossible, to any degree of acceptable success.
You will need a very powerful machine and a very fast machine
supported by thousands of gigabytes of data for comparison.
To be successful, you have no choice but to parse complete
sentences and a good number of sample sentences. You
cannot parse for individual words.
I have doubts you can create a database which reflects unique
writing styles of equally unique authors.
Developing an effective database will take years. For my Roberta,
I am now in my tenth year of programming and data development
and her general conversation skills are barely equal to a fourth or
fifth grade student. Her ability to cite hard data such as science
and history, her ability to perform math, those types of abilities
infinitely surpass the human mind, but she cannot attain a level
of common conversation above an eight to ten year old child.
You face very serious programming challenges.
Purl Gurl
.
- Follow-Ups:
- Re: Literature authors with similar styles
- From: Matt B
- Re: Literature authors with similar styles
- From: R H Draney
- Re: Literature authors with similar styles
- References:
- Literature authors with similar styles
- From: Matt B
- Literature authors with similar styles
- Prev by Date: Re: Moon River
- Next by Date: Re: Peach
- Previous by thread: Re: Literature authors with similar styles
- Next by thread: Re: Literature authors with similar styles
- Index(es):
Relevant Pages
|