talkbank

Moved much of this to Drafts instead

phonbank: poor articulation
disfluent kids
late talkers
Write a review about ASR benchmark methods
- REV would be our benchmark
  - What corpora we use?
  - Has anyone used disordered speech?
  - Or really seriously accented speech vis a vi CORALL (how was CORALL sampled?)
- What samples? How do we sample? What are the benchmarks?

ASR model + WER
tildes and noprompt swapped
WER
missing words
correct alignment

things

swap noprompt backwards
apostrophies for quotes
the word separation error
- put tilde BETWEEN specific symbols with connection symbols
jemoka becomes batchalign 2
Extended UD?

combining

bash script to run batchalign multiple times throughout the directories

Removing

removing non-auditory SBCA corpus area

Diarization

Diarization as a Bi-product of ASR

humans at the end

do speaker ID in the end
DO TO BATCHALIGN
allow people to reject files

runhouse meeting

<>Donny Greenberg: ADNE, nurses’ health
implementation at Google: grantees of canniniminty

Remaining questions

but we can’t provide SSH
function.save()
remote
- running through hashicorp vault?
- serializing ssh key remote?
- RUNHOUSE call into remote!
- headscale
take wave2vec and hubert and GSLM

questions?

ask about inter-turn pauses, where
- INV: something something something <-
- PAR: WWW <-
- INV: somethingsomething else <-
- PAR: words words word
no bullets are given for PAR, so do we skip it? do we count the time for WWW all as an inter-turn pause between INV and PAR? etc.

Per Turn

Turn level analysis
Rename tier to

Silence duration?

does it include inter-utterance pauses?
within-utterance pause
- fluency, mechanistic
between-utterance pause
- pause between utterances
also: between-speaker pause!
- leaves room for the speaker to take the floor
- BETWEEN speaker pauses: “I don’t know what you are asking me”, etc.: “breakdown!”
add features: STOPPA, TRESTLE, Wang

https://coryshain.github.io/

featurize

saturnino
fausa

Questions

What features?
Where to put them?
TalkBankDB
How to encode the features?

“How informative are your features”

Start coming up with features (TRESTLE, perhaps)
Encode them into xarray
<> saturnino

stuff

make Spanish names list
name, city, countries

corpuses

SABSAE: santa barbara english
CABNC: British english

ignore any words that goes wrong in the pipeline
~change: noun => n; verb => v, etc.~
DET: ignore “DEF”, or perhaps the entir featureless
unbulleted VAD exprimentents

errors!

line 1492

*PAR: so ‡ anyway I tiptoe to the front door , open the front door and walk in . •1045194_1050644• %mor: co|so beg|beg adv|anyway pro:sub|I v|+n|tip+n|toe prep|to det:art|the n|front n|door cm|cm adj|open det:art|the n|front n|door coord|and n|walk adv|in . %gra: 1|0|BEG 2|1|BEGP 3|5|JCT 4|5|SUBJ 5|0|ROOT 6|5|JCT 7|9|DET 8|9|MOD 9|6|POBJ 10|5|LP 11|14|MOD 12|14|DET 13|14|MOD 14|5|OBJ 15|14|CONJ 16|15|COORD 17|16|NJCT 18|5|PUNCT

errors?

words without features needs to be correctly handled (done in the middle of meeting)
04111 (me ma SOS)
nouns shouldn’t mark if it is Com,Neut, should’nt mark if its Com
fix PASTP => PAST
and does past participles exist?

Move shua to d(e)
Include instructions on how to recreate a broken Conda environment
Update the package to conda somehow

move

next steps

deal with `n`
+… fix
remove bullets

results

~ contraction
& fused
- suffix
getting rid of punkt in mor
- , => cm
- . => no PUNKT, stays

stuff

chocolaty (noadmin, https://docs.chocolatey.org/en-us/choco/setup#non-administrative-install)
miniconda
setx path “%path%;C:\tools\miniconda3\condabin”
curl env first, the install (Windows can’t do it from a URL)

readme

~~conda init zsh (close shell, open again)~~
.mp4
mfa model downloading
what’s the difference between online docker install and manual install
NLTK Huggingface transformers tokenizers (versining)
/opt/homebrew/Caskroom/miniforge/base/envs/aligner/lib/python3.9/site-packages/montreal_forced_aligner/corpus/text_corpus.py; getattr(self, k).update(error_dict[k])

AttributeError: ’list’ object has no attribute ‘update’ FileArgumentNotFoundError: ; line 139

DBA

See the data on the frequency of haphax legomina vs. COCA

ESPNet

need to talk to Ji Yang

Andrew’s Features

Collapse two PAR tiers down
~~Checkpoint per file~~
~~One corpus prompt per run~~
~~Handle empty tiers~~
~~I/P selection crashes! contingency~~
~~preview the LONGEST segment instead of the top one~~
~~-i kill in the middle~~

fixes

“my mom’s cryin(g)” [<] mm [l648] (also themmm after)
“made her a nice dress” [<] mhm [l1086]
“when I was a kid I” &=laughs [l1278]

Others

chstring (for uh, mm-hmm)
retrace (asr&fa folder)
lowcase (caps)
rep-join.cut (fixes/)

numbers
<affirmative>
‘mo data!
- CallFriend/CallHome (ca-data)
- ISL?
- SBCSAE
- Aphasia + MICASE
- TBI data
Providing a Two-Pass Solution
Writing
- Big description of the pipeline
- Notion of the pipeline
- Better tokenization?
8/18

Initial segment repetition
Extracting studdering
Gramatically problematic

mar

mar has done a thing and its phoneme level
We did it, now automated
LEAP data

next actions

Aphasia (-apraxia?): classification
Child data (EllisWeismer)
Dementia

a

~Multiple @Begin/CHECK problem~
~Placement of @Options~
~Strange, missing period~
~Bracket comments should FOLLOW words instead of PRECEEDING them~
~%xwor: line~
STICK TO DASHES WHEN DISTRIBUTING BATCHALIGN
end the utterance when it ends (incl. inter-utterance pauses)
“I” need to be capitalized
11005 (LT)

Align EllisWeismer

Also cool to align:

fluency IISRP/*
https://en.wikipedia.org/wiki/Speaker_diarisation
https://universaldependencies.org/

Alzheimer’s Project

https://dementia.talkbank.org/
https://luzs.gitlab.io/adresso-2021/
Specifically: https://dementia.talkbank.org/access/English/Pitt.html
Review Kathleen Fraser: https://drive.google.com/drive/u/1/folders/1lYTIzzXLXw3LlDG9ZQ7k4RayDiP6eLs1
Here are the review papers: https://drive.google.com/drive/u/1/folders/1pokU75aKt6vNdeSMpc-HfN9fkLvRyutt
Read this first: https://drive.google.com/drive/u/1/folders/0B3XZtiQwQW4XMnlFN0ZGUndUamM?resourcekey=0-AlOCZb4q9TyG4KpaMQpeoA
Some PITT data have 3-4 recordings

The best way to diagnosing alzhimers’ is from language.

Why this field is needed: to analyze a pre-post test metric.

Desired output: existence of dementia (a.k.a alzheimer’s’).

things

combining

Removing

Diarization

humans at the end

runhouse meeting

Remaining questions

questions?

Per Turn

Silence duration?

featurize

Questions

“How informative are your features”

stuff

corpuses

next

errors!

errors?

more

move

next steps

results

stuff

readme

DBA

ESPNet

Andrew’s Features

fixes

Others

mar

next actions

a

Alzheimer’s Project