Re: Speech to text
- From: agalkin@xxxxxxxxxxx
- Date: 13 Jun 2006 15:05:24 -0700
Thank you for your reponse. Actually what I want is quite simple. I
want to take audio and create a time stamp table of values where words
start. For instance the first word start at 250 ms, the second word on
700 ms, the third word on 1.5 s, etc. I do have access to both audio
and corresponding text.
tony.nospam@xxxxxxxxxxxxxxxxxxxxxxx wrote:
agalkin@xxxxxxxxxxx writes:
Is there any software preferrable shareware that allows to find time
offsets of words in a speech audio?
The quick answer is: Yes!
The slower answer has some questions for you:
Do you know what was spoken?
How long is your utterance?
How good are your acoustic conditions?
Do you want off-the-shelf or are you prepared to code?
If you know what was spoken then various companies (inc mine) can supply
you with a solution that is accurate to within perceptual discrimination
(some tens of milliseconds). Given good hacking abilities you can now
do it pretty much do it yourself with toolkits such as HTK
(htk.cam.ac.uk). I'm pleased to have been responsible for the first
automated subtitles broadcast in the UK - which is a similar task
although it involved a lot more than straight speech to text alignment.
If you don't know what is spoken then you can't get every word right,
but it's still doable. Prof. Steve Renals and myself ran a conference
in Cambridge called "Accessing information in spoken audio"
(http://svr-www.eng.cam.ac.uk/~ajr/esca99.html) back in 1999 which
provided much of the basics.
So, if your audio is good it's pretty much a question of how much you
want to spend on software. As you say that you'd prefer freeware then
I'd definitely recommend HTK - it won't cost you anything in software
licenses to to download, train up your system and produce speech to text
alignments - although it will take significant expenditure in time.
Tony
(ob ad: CxO Cantab Research: see http://www.cantabResearch.com)
.
- Follow-Ups:
- Re: Speech to text
- From: Alan W Black
- Re: Speech to text
- From: tony . nospam
- Re: Speech to text
- References:
- Speech to text
- From: agalkin
- Re: Speech to text
- From: tony . nospam
- Speech to text
- Prev by Date: Re: Speech to text
- Next by Date: Re: Speech to text
- Previous by thread: Re: Speech to text
- Next by thread: Re: Speech to text
- Index(es):
Relevant Pages
|