220 lines
8.3 KiB
Plaintext
220 lines
8.3 KiB
Plaintext
|
|
Note that not all features discussed in this file are included in
|
|
the standard distribution.
|
|
|
|
HISTORY
|
|
|
|
June 21st 2001 1.4.2 Release
|
|
Various new gcc's support
|
|
Visual C++ 6.0 support
|
|
uses configure (though could do so even more)
|
|
substantial updates to the clunits unit selection module
|
|
lots of wee bugs fixed
|
|
a few very hard bugs fixed
|
|
(client/server race condition)
|
|
(dropped bytes in reading files when machine overloaded)
|
|
(FreeBSD memory/gc problem)
|
|
default waverform type is now RIFF.
|
|
|
|
Nov 21st 1999 1.4.1 Release
|
|
SSFF (for emulabel) track support
|
|
AIX support
|
|
Java fixes
|
|
various minor bug fixes
|
|
WFST with proper quoting
|
|
Wagon sample counts
|
|
gcc-2.95.1 support
|
|
|
|
June 20th 1999 1.4.0 Release
|
|
becomes free software
|
|
|
|
June 6th 1999 1.3.95 Beta
|
|
size/speed/memory leak overhaul (no memory leaks)
|
|
XML support for relation loading (for SOLE support)
|
|
JSAPI initial support
|
|
GalaxyCommunicator architecture interface
|
|
ked_mttilt_diphone voice built
|
|
Parser trained on MARSEC (prosodic) brackets rather than syntax
|
|
Unisyn_selection fully integrated
|
|
Unisyn_phonology fully integrated
|
|
viterbi cart/ngram/wfst base LTS prediction (did improve but BIG)
|
|
viterbi cart/ngram based accent prediction (didn't improve)
|
|
tilt working (again)
|
|
audioin (na_record) for many architectures
|
|
viterbi from Scheme (with cart, ngram, wfst models)
|
|
|
|
January 26th 1999 1.3.1 Release
|
|
egcs-1.1.1 support
|
|
tobi_rules update (GM)
|
|
replace readline with editline (+ extensions)
|
|
Lots of little bug fixes
|
|
cluster code tidied up
|
|
kal voice
|
|
ked power normalization
|
|
updated lexicons with addenda for US and UK
|
|
New LTS models for US and UK English
|
|
"Building Voices in Festival" document
|
|
|
|
August 24th 1998 1.3.0 Release
|
|
UniSun/groupfile optimizations
|
|
Java client support
|
|
Fixed ESPS so both track and wave output works
|
|
Retraining of most modules with new architecture (durations improved)
|
|
rxp, (Richard's XML parser) integrated and Sable XMLified
|
|
Fringe display program for labels and utterances
|
|
Metrical tree synthesis
|
|
A new utterance architecture (Relations and Items)
|
|
utterance save and load work properly now
|
|
Trainable LTS system
|
|
Lexicon cache system
|
|
Substantial optimization of front end (twice the speed)
|
|
UniSyn, new signal processing and generic waveform synthesis module
|
|
OLS code added
|
|
WFST support for kk rules, regular grammars etc, simple English morphology
|
|
|
|
November 30th 1997 1.2.4 BETA
|
|
Tilt analysis and Tilt intonation modules added.
|
|
make_utts substantially improved (> 100 times faster)
|
|
text2wave script added
|
|
Pitch synchronous lpc analysis and support
|
|
rab consonant clusters labelled
|
|
New duration tree (wagon stepwise) much smaller if not better
|
|
SCFG grammar and parser (scfg_parse_text added as festival script)
|
|
change config stuff (again)
|
|
|
|
October 1st 1997 1.2.1 RELEASE
|
|
preliminary support for Visual C++
|
|
Use path-append rather than string-append (in buckets of places)
|
|
Minor bugs fixes throughout the code (end silences are now *always*
|
|
inserted in tts)
|
|
Linux socket bug fixed (get_url didn't work)
|
|
native irix audio support
|
|
|
|
September 5th 1997 1.2.0 RELEASE
|
|
Proclaim modules and voices
|
|
automatic detection of voices
|
|
Phonset, lexicon, ltsrules listing and printing
|
|
16 bit linear native support for Solaris i386 (sb16)
|
|
Update Festival Tutorial to 1.2.0
|
|
|
|
Aygust 15th 1997 1.1.99 Beta release
|
|
Win NT (and 95) initial support Cygnus win32 and Visual C++
|
|
100 more pages of documentation
|
|
LPC analysis for voices now ESPS independent
|
|
Spanish el voices tidy up (Borja)
|
|
ToBI by rule implementation
|
|
Confirmed support for gcc-2.7.2, gcc-2.6.3, Linux, FreeBSD, SunOS
|
|
Alpha and SunCC port on Solaris
|
|
reference card added
|
|
return s-expressions in server/client mode
|
|
OGI markup mode added.
|
|
Native support for sun16, linux16 and freebsd (compile time option)
|
|
Changed names of .C files to .cc files for bILL
|
|
wagon integrated into speech tools (plus docs)
|
|
auto-text-mode-alist for automatic selection of text mode from file name
|
|
Associated token tests added
|
|
Many more tokens dealt with (numbers, money, roman, phone, etc.)
|
|
(analysed databases to see what coverage is like)
|
|
A probablistic chart parser (no significant grammars though)
|
|
RJC's new database/units/join/modify modules taking shape
|
|
Some more examples added to the tutorial (with answers)
|
|
Integrated CSLU changes for OGItoolkit including TCL support
|
|
stml support for phrase types and words inline
|
|
ssml -> stml
|
|
Postlexical rules done in Scheme rather than C++
|
|
Rest of functions to allow any manipulation of utterance from scheme
|
|
New duration models trained for both English and American
|
|
New lexicon (CMU based)
|
|
Consonant cluster support (for kd)
|
|
American diphone set
|
|
Cluster unit selection algorithm more robust
|
|
Ngram backoff smoothing
|
|
Token pos, for numbers (97.5%) but does poor on phone numbers
|
|
New lexicon with final Rs and r deletion as postlex rule
|
|
Update pos prediction (ts39) and phrase break ngrams (faster to load)
|
|
New ngram format (binary files, and smoothing)
|
|
Vowel reduction module
|
|
Sun CC port
|
|
New string class (rjc) remove dependence on libg++
|
|
Update of course notes and new section on building models from dbs
|
|
Yarowsky homograph disambiguation
|
|
|
|
Jan 24th 1997 1.1.1 release (first public release)
|
|
a number of configuration and INSTALL documentation bugs fixed
|
|
SSML tidied up and a festival script provided for it.
|
|
Diphones, again, checked and copyright explicitly added
|
|
|
|
Jan 6th 1997 1.1.0 release
|
|
Roger diphones now default speaker
|
|
A new unit clustering algorithm with acoustic costs and
|
|
optimal coupling
|
|
BSD socket client/server support
|
|
A format function in Scheme (fprintf-like)
|
|
A short course on Speech Synthesis in Festival
|
|
(with course notes and exercises)
|
|
A programmable form of text modes including externally customizable
|
|
token to word rules.
|
|
Fully programmable intonation module (for ToBI-like theories)
|
|
Backtrace facility in Lisp
|
|
Externally specified Utterance end (for all tts modes) with lookahead
|
|
Roger diphones, first draft
|
|
|
|
Nov 8th 1.0.0 release
|
|
Substantial bug fixes, stabilization and documentation updates
|
|
Added residual excited LPC synthesizer and removed PSOLA code.
|
|
Made sucs and taylor optional modules, new modules
|
|
can be added without modifying the base code
|
|
MOBY lexicon (not as good as cuvoald but free)
|
|
New diphone grouping software
|
|
A new diphone database module (free from adc)
|
|
|
|
Sept 30th 0.1 release
|
|
MBROLA support (good example of external module)
|
|
latest news: read out the latest news (from Time Warner, Pathfinder)
|
|
audio spooler
|
|
--language option on command line
|
|
Spanish synthesis
|
|
Letter to sound rules as external system (replacing all the NRL code)
|
|
Welsh synthesis, making the whole system more language independent
|
|
sucs spoke in reasonably way (gsw_450 and f2b dbs)
|
|
document strings for functions (built in and user) and variables
|
|
access from command line and dumped automatically into texinfo
|
|
cleaned up SSML implementation
|
|
break prediction integrated using viterbi and pos
|
|
sucs module started (selection of units for concatenative synthesis)
|
|
a part-of-speech tagging system (ngram/viterbi based)
|
|
viterbi code added
|
|
fixes in SIOD for running batch and stdin, also sub_prompts added
|
|
saytime example
|
|
Memory leaks fixed, no leaks for tts
|
|
|
|
July 30th 0.0 release (just for the sake of it)
|
|
a significiant start at documentation (texinfo -> info & html)
|
|
festival scripts using #! on first line
|
|
donovan diphone support
|
|
can compile (with too many warnings) under g++ 2.7.2
|
|
copyrights on all files
|
|
memory leak checks (only 8 bytes for "unknown" words)
|
|
SSML (and tts file modes)
|
|
cuvoald cmu and beep lexicons
|
|
lexicon compilation
|
|
web page, emacs interface
|
|
June 2nd
|
|
Klatt duration module
|
|
syllabification in phones from letter to sound rules
|
|
Linear Regression model for F0 prediction (from ToBI labels)
|
|
CART (wagon) built trees for duration (zscores), phrase boundaries,
|
|
accent and endtone prediction.
|
|
ffeatures allowing specification of features of an utterance
|
|
May
|
|
integrated Taylor diphone module
|
|
US Naval Research letter to sound rules
|
|
CSTR lexicon
|
|
|
|
12th April first words "hello"
|
|
|
|
start with speech_tools library, scheme-in-one-defun and readline
|
|
and external CSTR diphone synthesizer
|
|
|
|
7th April 1996 work started
|