audio2text

docker-based pocketsphinx that takes A/V input file to create text from the audio and extract keywords and entities from the results.

Allows one to, once SETUP is done, run locally on a Mac, network disabled / no cloud, and run speech to text and text analysis on an A/V file.

PREREQUISTES

git (brew or XCode setups have you covered ;-)

git clone https://github.com/traceypooh/audio2text.git
cd audio2text
docker build -t audio2text .

( docker run --rm -i audio2text |tar xf - ) < test.mp3

will make (click each to see the results):

out.json - detailed word/phrase with timings
out.txt - transcript of entire audio/video file
out.srt - timed transcript of audio/video file
out.key - keywords extracted from .txt (above)
out.plo - Persons, Locations, Organizations (and more) extracted from .txt (above)

Name		Name	Last commit message	Last commit date
Latest commit History 35 Commits
Dockerfile		Dockerfile
README.md		README.md
out.json		out.json
out.key		out.key
out.plo		out.plo
out.srt		out.srt
out.txt		out.txt
run.sh		run.sh
srt2txt.sh		srt2txt.sh
test.mp3		test.mp3