Sunbelt Computer Software

This repository contains scripts for extracting text and speech features. Features are primarily designed to be extracted from the outputs of automatic speech recognition models (e.g. text, timing, confidence scores).

The subdirectories are as follows:

text_features contains scripts for extracting text features. They can be extracted from transcribed speech or written text.
microsoft_asr_features contains scripts for extracting features specific to the output of Microsoft's speech-to-text models. This includes a script for extracting the text-features mentioned above form the output of Microsoft's speech-to-text model as well as scripts for extracting features related to word-level timing and model confidence scores.
kaldi_asr_features contains scripts for extracting feature specific to the output of Kaldi ASR models, including word and phone-level timing features, ASR confidence scores, and non-verbal expression counts.
archived contains outdated/in-progress scripts that are not yet documented.

Note: the input files expected by the scripts within the microsoft_asr_features and kaldi_asr_features directories can be produced using the code in this repository (see the asr-models-support directory).

Name		Name	Last commit message	Last commit date
Latest commit History 22 Commits
archived		archived
kaldi_asr_features		kaldi_asr_features
microsoft_asr_features		microsoft_asr_features
text_features		text_features
timing_features		timing_features
README.md		README.md

Sunbelt Computer Software

PL/B Language Development and Support

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Sunbelt Computer Software

PL/B Language Development and Support

Folders and files

Latest commit

History

Repository files navigation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages