Conversation
* Minor syntax tweaks to make the code Python 3 compatible * Fixes for various NumPy warnings/errors, either due to use of "float" where "int" is required, or domain errors on log functions * Replaced the use of the obsolete Python-2-only scikits.talkbox library with a compatible LPC implementation from the Conch project * Documentation update to indicate that an old version of "rnn" is required * Invoke Lua scripts via "luajit" directly, instead of going through the "th" frontend (to reduce the dependency footprint)
|
(Pushed again to fix a minor goof in the README.) This PR should address issues #3 (in part), #5, and #7. I would appreciate an especially careful review of my changes to the More work is needed, of course. First, the tracking model needs to be rebuilt using a current version of I tested my changes with two speech files; one was 19 seconds long, the other 67 seconds. I ran DeepFormants on a multiprocessor system (Intel Xeon, no GPU) with 48 GB RAM. In both cases, the feature-extraction stage took a while to run, presumably due to the pure-Python replacement LPC implementation. No big deal. But the second stage, when Torch is invoked... the small file led to a peak memory usage of 33 GB. It didn't take particularly long, which makes me suspect all that memory was allocated but hardly used. With the large file, the usage got up to 50 GB, and once it was clear that swapping was slowing the program down to a crawl, I terminated the run. |
Minor syntax tweaks to make the code Python 3 compatible
Fixes for various NumPy warnings/errors, either due to use of
floatwhereintis required, or domain errors on log functionsReplaced the use of the obsolete Python-2-only
scikits.talkboxlibrary with a compatible LPC implementation from the Conch projectDocumentation update to indicate that an old version of
rnnis requiredInvoke Lua scripts via
luajitdirectly, instead of going through thethfrontend (to reduce the dependency footprint)