<?xml version="1.0" encoding="windows-1251"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom">
	<channel>
		<atom:link href="https://uakaldiinstructional.mybb.rocks/export.php?type=rss" rel="self" type="application/rss+xml" />
		<title>UA - kaldi instructional</title>
		<link>https://uakaldiinstructional.mybb.rocks/</link>
		<description>UA - kaldi instructional</description>
		<language>ru-ru</language>
		<lastBuildDate>Fri, 30 Mar 2018 22:24:00 +0300</lastBuildDate>
		<generator>MyBB/mybb.ru</generator>
		<item>
			<title>Submission: Week 10</title>
			<link>https://uakaldiinstructional.mybb.rocks/viewtopic.php?pid=51#p51</link>
			<description>&lt;p&gt;&lt;a href=&quot;https://drive.google.com/file/d/1xaK3EIzwvZr92CyqaEK9dDsYlvfUdnix/view?usp=sharing&quot; rel=&quot;nofollow&quot; target=&quot;_blank&quot;&gt;https://drive.google.com/file/d/1xaK3EI &amp;#8230; sp=sharing&lt;/a&gt;&lt;/p&gt;</description>
			<author>mybb@mybb.ru (CathyWalsh)</author>
			<pubDate>Fri, 30 Mar 2018 22:24:00 +0300</pubDate>
			<guid>https://uakaldiinstructional.mybb.rocks/viewtopic.php?pid=51#p51</guid>
		</item>
		<item>
			<title>Submission: Week 8</title>
			<link>https://uakaldiinstructional.mybb.rocks/viewtopic.php?pid=44#p44</link>
			<description>&lt;p&gt;Masha&#039;s submission:&lt;/p&gt;
						&lt;p&gt;&lt;a href=&quot;https://drive.google.com/file/d/1oqtpLzccfyiHTVHFmCPU2OM3MfYNbjm3/view&quot; rel=&quot;nofollow&quot; target=&quot;_blank&quot;&gt;https://drive.google.com/file/d/1oqtpLz &amp;#8230; Nbjm3/view&lt;/a&gt;&lt;/p&gt;</description>
			<author>mybb@mybb.ru (zupon)</author>
			<pubDate>Fri, 16 Mar 2018 21:57:49 +0300</pubDate>
			<guid>https://uakaldiinstructional.mybb.rocks/viewtopic.php?pid=44#p44</guid>
		</item>
		<item>
			<title>Week 9 Agenda</title>
			<link>https://uakaldiinstructional.mybb.rocks/viewtopic.php?pid=39#p39</link>
			<description>&lt;p&gt;0. Questions/Concerns&lt;br /&gt;1. Look ahead to final week and 10.HW: create your own experiment&amp;#160; &amp;#160; &amp;#160; &amp;#160; &lt;br /&gt;2. Decoding at 50k feet&lt;br /&gt;3. Comparing results&lt;br /&gt;4. Lattices&lt;br /&gt;5. decoding hyperparameters&lt;br /&gt;&amp;#160; &amp;#160; A. `lmwt`&lt;br /&gt;&amp;#160; &amp;#160; B. `beam`&lt;br /&gt;&amp;#160; &amp;#160; C. `max_active`&lt;br /&gt;6. using `analyze_*.log`s&lt;/p&gt;</description>
			<author>mybb@mybb.ru (Michael Capizzi)</author>
			<pubDate>Fri, 09 Mar 2018 19:17:38 +0300</pubDate>
			<guid>https://uakaldiinstructional.mybb.rocks/viewtopic.php?pid=39#p39</guid>
		</item>
		<item>
			<title>Week 8 Agenda</title>
			<link>https://uakaldiinstructional.mybb.rocks/viewtopic.php?pid=38#p38</link>
			<description>&lt;p&gt;0. Questions/Concerns&lt;br /&gt;1. update codebase for weeks 9 and 10&lt;br /&gt;&amp;#160; &amp;#160; &amp;#160;A. `sudo git pull`&lt;br /&gt;&amp;#160; &amp;#160; &amp;#160; &amp;#160;- if conflicts:&lt;br /&gt;&amp;#160; &amp;#160; &amp;#160; &amp;#160; &amp;#160; &amp;#160;1.`sudo git add` all files listed in output&lt;br /&gt;&amp;#160; &amp;#160; &amp;#160; &amp;#160; &amp;#160; &amp;#160;2. `sudo git commit -m &amp;quot;before pull&amp;quot;`&lt;br /&gt;&amp;#160; &amp;#160; &amp;#160; &amp;#160; &amp;#160; &amp;#160;3. `sudo git pull`&lt;br /&gt;2. The **full** ASR pipeline&lt;br /&gt;3. Look ahead to final week and 10.HW: create your own experiment&amp;#160; &amp;#160; &amp;#160; &amp;#160; &lt;br /&gt;4. Review Case Studies&lt;br /&gt;5. HCLG composition&lt;br /&gt;6. Next week&lt;br /&gt;&amp;#160; &amp;#160; A. Our first WER/SER results!&lt;/p&gt;</description>
			<author>mybb@mybb.ru (Michael Capizzi)</author>
			<pubDate>Fri, 09 Mar 2018 01:10:12 +0300</pubDate>
			<guid>https://uakaldiinstructional.mybb.rocks/viewtopic.php?pid=38#p38</guid>
		</item>
		<item>
			<title>Week 7 Agenda</title>
			<link>https://uakaldiinstructional.mybb.rocks/viewtopic.php?pid=37#p37</link>
			<description>&lt;p&gt;0. Questions/Concerns&lt;br /&gt;1. Remaining Schedule&lt;br /&gt;2. Pull updates&lt;br /&gt;&amp;#160; &amp;#160; A. `cd /scratch/kaldi`&lt;br /&gt;&amp;#160; &amp;#160; B. `sudo git pull`&lt;br /&gt;3. FSTs at 50k feet&lt;br /&gt;4. setup HW template&lt;br /&gt;5. look at `fst_manipulate.py`&lt;br /&gt;6. FSTs in `kaldi`&lt;br /&gt;&amp;#160; &amp;#160; A. `HCLG` --&amp;gt; after break&lt;br /&gt;&amp;#160; &amp;#160; B. `G` --&amp;gt; today&lt;/p&gt;</description>
			<author>mybb@mybb.ru (Michael Capizzi)</author>
			<pubDate>Thu, 01 Mar 2018 18:57:44 +0300</pubDate>
			<guid>https://uakaldiinstructional.mybb.rocks/viewtopic.php?pid=37#p37</guid>
		</item>
		<item>
			<title>Week 6 Agenda</title>
			<link>https://uakaldiinstructional.mybb.rocks/viewtopic.php?pid=36#p36</link>
			<description>&lt;p&gt;0. Questions/Concerns&lt;br /&gt;1. Conserving AUs&lt;br /&gt;2. Acoustic Modeling at 50k feet&lt;br /&gt;&amp;#160; &amp;#160; A. train a decision tree to &amp;quot;state-tie&amp;quot; certain triphones together &lt;br /&gt;&amp;#160; &amp;#160; B. train an HMM for each triphone where the transition probabilities are modeled by GMM (`pdf`)&lt;br /&gt;&amp;#160; &amp;#160; &amp;#160; &amp;#160; - typically 3-5 states&lt;br /&gt;&amp;#160; &amp;#160; &amp;#160; &amp;#160; &amp;#160; &amp;#160; &amp;#160; - way of dealing with dipthongs?&lt;br /&gt;&amp;#160; &amp;#160; C. given a frame, find the most representative `pdf`&lt;br /&gt;3. Three Acoustic Models&lt;br /&gt;&amp;#160; &amp;#160; &amp;#160;A. Monophones&lt;br /&gt;&amp;#160; &amp;#160; &amp;#160; &amp;#160; &amp;#160; - quality of model not important&lt;br /&gt;&amp;#160; &amp;#160; &amp;#160; &amp;#160; &amp;#160; - poor alignment, but a &amp;quot;good start&amp;quot; that will be bootrapped&lt;br /&gt;&amp;#160; &amp;#160; &amp;#160;B. Triphones&lt;br /&gt;&amp;#160; &amp;#160; &amp;#160; &amp;#160; &amp;#160; - model context of triphones&lt;br /&gt;&amp;#160; &amp;#160; &amp;#160; &amp;#160; &amp;#160; - require &amp;quot;state tying&amp;quot; because the number of triphones is too large&lt;br /&gt;&amp;#160; &amp;#160; &amp;#160; &amp;#160; &amp;#160; &amp;#160; &amp;#160; - done by decision tree, asking &amp;quot;questions&amp;quot; about left and right context&lt;br /&gt;&amp;#160; &amp;#160; &amp;#160;C. Triphones + LDA_MLLT&lt;br /&gt;&amp;#160; &amp;#160; &amp;#160; &amp;#160; &amp;#160;- a transformation of the triphone model&lt;br /&gt;&amp;#160; &amp;#160; &amp;#160; &amp;#160; &amp;#160; &amp;#160; &amp;#160; - for what purpose?&lt;br /&gt;4. Highlights of `exp` dir&lt;br /&gt;&amp;#160; &amp;#160; A. the `mdl` file&lt;br /&gt;5. Intuitions about acoustic modeling&lt;br /&gt;&amp;#160; &amp;#160; A. number of Gaussians&lt;br /&gt;&amp;#160; &amp;#160; B. number of leaves&lt;br /&gt;5. Next week&lt;br /&gt;&amp;#160; &amp;#160; A. openFST: &lt;a href=&quot;http://www.openfst.org/twiki/bin/view/FST/WebHome&quot; rel=&quot;nofollow&quot; target=&quot;_blank&quot;&gt;http://www.openfst.org/twiki/bin/view/FST/WebHome&lt;/a&gt;&lt;br /&gt;&amp;#160; &amp;#160; B. revisit our `ARPA` language model as an `FST`&lt;/p&gt;</description>
			<author>mybb@mybb.ru (Michael Capizzi)</author>
			<pubDate>Fri, 23 Feb 2018 19:04:00 +0300</pubDate>
			<guid>https://uakaldiinstructional.mybb.rocks/viewtopic.php?pid=36#p36</guid>
		</item>
		<item>
			<title>Submission: Week 5</title>
			<link>https://uakaldiinstructional.mybb.rocks/viewtopic.php?pid=35#p35</link>
			<description>&lt;p&gt;&lt;a href=&quot;https://drive.google.com/file/d/1aeiaIWsBQCFrsGvCH6W9GZw5GcBq4yFX/view?usp=sharing&quot; rel=&quot;nofollow&quot; target=&quot;_blank&quot;&gt;https://drive.google.com/file/d/1aeiaIW &amp;#8230; sp=sharing&lt;/a&gt;&lt;/p&gt;</description>
			<author>mybb@mybb.ru (zupon)</author>
			<pubDate>Fri, 16 Feb 2018 09:27:36 +0300</pubDate>
			<guid>https://uakaldiinstructional.mybb.rocks/viewtopic.php?pid=35#p35</guid>
		</item>
		<item>
			<title>Week 5 Agenda</title>
			<link>https://uakaldiinstructional.mybb.rocks/viewtopic.php?pid=27#p27</link>
			<description>&lt;p&gt;0. Questions/Concerns&lt;br /&gt;1. Case Study Reviews&lt;br /&gt;2. Acoustic Modeling at 50k feet&lt;br /&gt;&amp;#160; &amp;#160; A. See &lt;a href=&quot;https://www.eleanorchodroff.com/tutorial/kaldi/kaldi-concept.html&quot; rel=&quot;nofollow&quot; target=&quot;_blank&quot;&gt;https://www.eleanorchodroff.com/tutoria &amp;#8230; ncept.html&lt;/a&gt;&lt;br /&gt;3. Three hierarchical acoustic models&lt;br /&gt;&amp;#160; &amp;#160; A. Monophones&lt;br /&gt;&amp;#160; &amp;#160; B. Triphones&lt;br /&gt;&amp;#160; &amp;#160; &amp;#160; &amp;#160; 1. decision tree&lt;br /&gt;&amp;#160; &amp;#160; &amp;#160; &amp;#160; 2. HMM&lt;br /&gt;&amp;#160; &amp;#160; C. Triphones with LDA&lt;br /&gt;3. Next Week&lt;/p&gt;</description>
			<author>mybb@mybb.ru (Michael Capizzi)</author>
			<pubDate>Thu, 15 Feb 2018 20:15:23 +0300</pubDate>
			<guid>https://uakaldiinstructional.mybb.rocks/viewtopic.php?pid=27#p27</guid>
		</item>
		<item>
			<title>Week 4 Agenda</title>
			<link>https://uakaldiinstructional.mybb.rocks/viewtopic.php?pid=26#p26</link>
			<description>&lt;p&gt;1. Questions/Concerns&lt;br /&gt;&amp;#160; &amp;#160; A. What to do when stuck in &amp;quot;networking&amp;quot;&lt;br /&gt;&amp;#160; &amp;#160; B. What happens if you mess up entering your password &lt;br /&gt;2. Quick review of `kaldi_config.json`(leftover from last week)&lt;br /&gt;&amp;#160; &amp;#160; A. easiest way to edit?&lt;br /&gt;3. Quick review of `data` dir (leftover from last week)&lt;br /&gt;&amp;#160; &amp;#160; A. See 3.2&lt;br /&gt;4. `MFCC` 50k foot view&lt;br /&gt;&amp;#160; &amp;#160; A. See &lt;a href=&quot;http://haythamfayek.com/2016/04/21/speech-processing-for-machine-learning.html&quot; rel=&quot;nofollow&quot; target=&quot;_blank&quot;&gt;http://haythamfayek.com/2016/04/21/spee &amp;#8230; rning.html&lt;/a&gt;&lt;br /&gt;&amp;#160; &amp;#160; B. Best way to visualize?&lt;br /&gt;5. `MFCC` config file&lt;br /&gt;6. review of `mfcc` dir&lt;br /&gt;&amp;#160; &amp;#160; A. `ark` v `scp`&lt;br /&gt;7. Set up HW template&lt;br /&gt;&amp;#160; &amp;#160; A. `python` kernel, can use `shell` cell with `%%bash`&lt;br /&gt;&amp;#160; &amp;#160; B. Review of available methods&lt;br /&gt;8. examine `mfcc` vectors&lt;br /&gt;9. Next week&lt;br /&gt;&amp;#160; &amp;#160; A. 4.HW&lt;br /&gt;&amp;#160; &amp;#160; B. Week 5 ==&amp;gt; building acoustic models&lt;/p&gt;</description>
			<author>mybb@mybb.ru (Michael Capizzi)</author>
			<pubDate>Tue, 06 Feb 2018 19:41:23 +0300</pubDate>
			<guid>https://uakaldiinstructional.mybb.rocks/viewtopic.php?pid=26#p26</guid>
		</item>
		<item>
			<title>Submission: Week 3</title>
			<link>https://uakaldiinstructional.mybb.rocks/viewtopic.php?pid=25#p25</link>
			<description>&lt;p&gt;&lt;a href=&quot;https://drive.google.com/file/d/1lyEIArnA2OR1Agc40mrCUhpitZwRACsk/view?usp=sharing&quot; rel=&quot;nofollow&quot; target=&quot;_blank&quot;&gt;https://drive.google.com/file/d/1lyEIAr &amp;#8230; sp=sharing&lt;/a&gt;&lt;/p&gt;</description>
			<author>mybb@mybb.ru (damianiji)</author>
			<pubDate>Fri, 02 Feb 2018 22:05:35 +0300</pubDate>
			<guid>https://uakaldiinstructional.mybb.rocks/viewtopic.php?pid=25#p25</guid>
		</item>
		<item>
			<title>Week 3 Agenda</title>
			<link>https://uakaldiinstructional.mybb.rocks/viewtopic.php?pid=17#p17</link>
			<description>&lt;p&gt;1. Questions/Concerns&lt;br /&gt;2. AUs&lt;br /&gt;&amp;#160; &amp;#160; a. if you don&#039;t see ~2300 AUs, click NEED MORE from your Dashboard; request 2304 AUs and explain you are part of the `kaldi` course&lt;br /&gt;3. Pruning&lt;br /&gt;&amp;#160; &amp;#160; a. What is the &amp;quot;meaning&amp;quot; of the threshold value?&lt;br /&gt;&amp;#160; &amp;#160; &amp;#160; &amp;#160; &amp;#160;- minimum unigram probability?&lt;br /&gt;&amp;#160; &amp;#160; &amp;#160; &amp;#160; &amp;#160;- minimum difference between probability and backoff?&lt;br /&gt;4. Review Week 2 HW&lt;br /&gt;5. Takeaways from Week 2 HW&lt;br /&gt;6. Confirming the correct config file&lt;br /&gt;7. Checking for error messages&lt;br /&gt;8. Review data dir&lt;br /&gt;&amp;#160; &amp;#160; a. Don&#039;t forget `lang_test_tg`!&lt;br /&gt;9. What to expect next week&lt;br /&gt;&amp;#160; &amp;#160; &amp;#160;a. MFCCs: Sections 10.1 - 10.5 in Holmes and Holmes&lt;/p&gt;</description>
			<author>mybb@mybb.ru (Michael Capizzi)</author>
			<pubDate>Tue, 30 Jan 2018 19:27:19 +0300</pubDate>
			<guid>https://uakaldiinstructional.mybb.rocks/viewtopic.php?pid=17#p17</guid>
		</item>
		<item>
			<title>Submission Process: Updated</title>
			<link>https://uakaldiinstructional.mybb.rocks/viewtopic.php?pid=14#p14</link>
			<description>&lt;p&gt;1. File -&amp;gt; Download As -&amp;gt; HTML&lt;br /&gt;2. Save HTML into your Arizona Google Drive&lt;br /&gt;3. Make public (or the option that allows for anyone with a `arizona.edu`address).&amp;#160; `view` rights are plenty.&amp;#160; It still enables downloading.&lt;br /&gt;3. Post the link in the Sticky Forum for that week&#039;s submission&lt;/p&gt;</description>
			<author>mybb@mybb.ru (Michael Capizzi)</author>
			<pubDate>Fri, 26 Jan 2018 23:46:36 +0300</pubDate>
			<guid>https://uakaldiinstructional.mybb.rocks/viewtopic.php?pid=14#p14</guid>
		</item>
		<item>
			<title>Week 2 Agenda</title>
			<link>https://uakaldiinstructional.mybb.rocks/viewtopic.php?pid=13#p13</link>
			<description>&lt;p&gt;0. Backup to volume&lt;br /&gt;&amp;#160; &amp;#160; a. Volume can be found in `/vol_c`&lt;br /&gt;&amp;#160; &amp;#160; b. directories to back up&lt;br /&gt;&amp;#160; &amp;#160; &amp;#160; &amp;#160; &amp;#160;- `raw_data`, `data`, `mfcc`, `exp`&lt;br /&gt;&amp;#160; &amp;#160; c. command to use&lt;br /&gt;&amp;#160; &amp;#160; &amp;#160; &amp;#160; &amp;#160;- `cp -r [dir_to_copy] [path_to_volume]`&lt;br /&gt;1. General Questions/Comments&lt;br /&gt;2. Big picture contribution of LM&lt;br /&gt;3. Brief summary of language modeling process: from corpus to LM&lt;br /&gt;4. Smoothing&lt;br /&gt;&amp;#160; &amp;#160; a. Why we need it&lt;br /&gt;&amp;#160; &amp;#160; b. Which approach to use&lt;br /&gt;5. IRSTLM&lt;br /&gt;&amp;#160; &amp;#160; a. manual: &lt;a href=&quot;http://hermes.fbk.eu/people/bertoldi/teaching/lab_2010-2011/img/irstlm-manual.pdf&quot; rel=&quot;nofollow&quot; target=&quot;_blank&quot;&gt;http://hermes.fbk.eu/people/bertoldi/te &amp;#8230; manual.pdf&lt;/a&gt;&lt;br /&gt;&amp;#160; &amp;#160; b. already compiled and code in `/scratch/kaldi/tools/irstlm/bin` (must view from *inside* `docker`!)&lt;br /&gt;&amp;#160; &amp;#160; c. how to use (see notebook 2.1)&lt;br /&gt;6. Next week&#039;s HW&lt;br /&gt;&amp;#160; &amp;#160; a. submission&lt;br /&gt;&amp;#160; &amp;#160; &amp;#160; &amp;#160; - `File -&amp;gt; Download As -&amp;gt; HTML`&lt;br /&gt;&amp;#160; &amp;#160; b. copy template&lt;br /&gt;7. Generating the probability of a sequence (see notebook 2.2)&lt;br /&gt;&amp;#160; &amp;#160; a. Default situation (len(sequence) &amp;lt;= size(n-gram) and n-gram in LM)&lt;br /&gt;&amp;#160; &amp;#160; b. Two special situations&lt;br /&gt;&amp;#160; &amp;#160; &amp;#160; &amp;#160; - n-gram not in LM&lt;br /&gt;&amp;#160; &amp;#160; &amp;#160; &amp;#160; - sequence is larger than n-gram&lt;br /&gt;8. &amp;quot;ate the lion&amp;quot; v. &amp;quot;ate the mouse&amp;quot; problem&lt;br /&gt;&amp;#160; &amp;#160; a. Why did it happen?&amp;#160; &amp;#160; &amp;#160;&lt;br /&gt;9. Impact of LM n-gram size&lt;br /&gt;10. Impact of LM size&lt;br /&gt;&amp;#160; &amp;#160; a. space&lt;br /&gt;&amp;#160; &amp;#160; b. speed&lt;br /&gt;&amp;#160; &amp;#160; c. alternative options&lt;br /&gt;&amp;#160; &amp;#160; &amp;#160; &amp;#160; - pruning (IRSTLM manual, section 5)&lt;br /&gt;&amp;#160; &amp;#160; &amp;#160; &amp;#160; &amp;#160; &amp;#160;- hyperparameter = ???&lt;br /&gt;&amp;#160; &amp;#160; &amp;#160; &amp;#160; - rescoring &lt;br /&gt;11. Intuitions about ARPA-style LMs&lt;br /&gt;12. n-gram LM v. RNN&lt;br /&gt;13.&amp;#160; What to expect next week&lt;br /&gt;&amp;#160; &amp;#160; a. Week 3 items&lt;br /&gt;&amp;#160; &amp;#160; &amp;#160; &amp;#160; 1. `kaldi_config.json` usage&lt;br /&gt;&amp;#160; &amp;#160; &amp;#160; &amp;#160; 2. Building the `data` directory&lt;br /&gt;&amp;#160; &amp;#160; b. Week 2 HW&lt;br /&gt;&amp;#160; &amp;#160; &amp;#160; &amp;#160; 1. identifying a case study&lt;br /&gt;&amp;#160; &amp;#160; &amp;#160; &amp;#160; 2. reviewing others&#039; case studies&lt;/p&gt;</description>
			<author>mybb@mybb.ru (Michael Capizzi)</author>
			<pubDate>Thu, 25 Jan 2018 20:24:57 +0300</pubDate>
			<guid>https://uakaldiinstructional.mybb.rocks/viewtopic.php?pid=13#p13</guid>
		</item>
		<item>
			<title>Week 1 Agenda</title>
			<link>https://uakaldiinstructional.mybb.rocks/viewtopic.php?pid=12#p12</link>
			<description>&lt;p&gt;0. Skim paper describing dataset building process: &lt;a href=&quot;https://www.danielpovey.com/files/2015_icassp_librispeech.pdf&quot; rel=&quot;nofollow&quot; target=&quot;_blank&quot;&gt;https://www.danielpovey.com/files/2015_ &amp;#8230; speech.pdf&lt;/a&gt;&lt;br /&gt;&amp;#160; &amp;#160; 1. Table 1&lt;br /&gt;&amp;#160; &amp;#160; 2. 2.2-2.3 Alignment&lt;br /&gt;&amp;#160; &amp;#160; 3. 2.4 Data Segmentation&lt;br /&gt;&amp;#160; &amp;#160; 4. Table 2 &lt;br /&gt;1. General Questions/Concerns&lt;br /&gt;2. Librispeech Dataset&lt;br /&gt;&amp;#160; &amp;#160; a. Librivox v. Project Gutenberg&lt;br /&gt;&amp;#160; &amp;#160; &amp;#160; &amp;#160; - &lt;a href=&quot;https://librivox.org/&quot; rel=&quot;nofollow&quot; target=&quot;_blank&quot;&gt;https://librivox.org/&lt;/a&gt;&lt;br /&gt;&amp;#160; &amp;#160; b. See paper here describing the build process of dataset: &lt;a href=&quot;https://www.danielpovey.com/files/2015_icassp_librispeech.pdf&quot; rel=&quot;nofollow&quot; target=&quot;_blank&quot;&gt;https://www.danielpovey.com/files/2015_ &amp;#8230; speech.pdf&lt;/a&gt;&lt;br /&gt;&amp;#160; &amp;#160; &amp;#160; &amp;#160; - See 2.4 Data Segmentation&lt;br /&gt;&amp;#160; &amp;#160; &amp;#160; &amp;#160; - See Table 1&lt;br /&gt;&amp;#160; &amp;#160; &amp;#160; &amp;#160; - See Table 2&lt;br /&gt;&amp;#160; &amp;#160; &amp;#160; &amp;#160; - See 2.2-2.3 Alignment&lt;br /&gt;2. Review of &amp;quot;Data Files&amp;quot;&lt;br /&gt;&amp;#160; &amp;#160; a. splits&lt;br /&gt;&amp;#160; &amp;#160; &amp;#160; &amp;#160; - train: `dev-clean`&lt;br /&gt;&amp;#160; &amp;#160; &amp;#160; &amp;#160; - test: `test-clean`&lt;br /&gt;&amp;#160; &amp;#160; &amp;#160; &amp;#160; - language model: `3-gram.pruned.3e-7.arpa`&lt;br /&gt;&amp;#160; &amp;#160; b. audio &lt;br /&gt;&amp;#160; &amp;#160; &amp;#160; &amp;#160; - 8 kHz v 16 kHz v flac&lt;br /&gt;&amp;#160; &amp;#160; &amp;#160; &amp;#160; &amp;#160; &amp;#160; - ~20% reduction in performance with 8kHz v 16kHz (&lt;a href=&quot;https://www.superlectures.com/odyssey2012/downloadFile?id=42&amp;amp;type=slides&amp;amp;filename=effects-of-audio-and-asr-quality-on-cepstral-and-high-level-speaker-verification-systems&quot; rel=&quot;nofollow&quot; target=&quot;_blank&quot;&gt;https://www.superlectures.com/odyssey20 &amp;#8230; on-systems&lt;/a&gt;)&lt;br /&gt;&amp;#160; &amp;#160; &amp;#160; &amp;#160; &amp;#160; &amp;#160; - listen to some samples&lt;br /&gt;&amp;#160; &amp;#160; &amp;#160; &amp;#160; - male v. female&lt;br /&gt;&amp;#160; &amp;#160; &amp;#160; &amp;#160; &amp;#160; &amp;#160; - listen to some samples&lt;br /&gt;&amp;#160; &amp;#160; c. segmented v. unsegmented&lt;br /&gt;&amp;#160; &amp;#160; &amp;#160; &amp;#160; - split on &amp;quot;pause&amp;quot;&lt;br /&gt;&amp;#160; &amp;#160; &amp;#160; &amp;#160; &amp;#160; &amp;#160; - pause = silence for more than X seconds&lt;br /&gt;&amp;#160; &amp;#160; &amp;#160; &amp;#160; &amp;#160; &amp;#160; - silence = no signal &amp;gt; Y dB&lt;br /&gt;&amp;#160; &amp;#160; d. phones&lt;br /&gt;&amp;#160; &amp;#160; &amp;#160; &amp;#160; &amp;#160;- see &lt;a href=&quot;http://www.speech.cs.cmu.edu/cgi-bin/cmudict&quot; rel=&quot;nofollow&quot; target=&quot;_blank&quot;&gt;http://www.speech.cs.cmu.edu/cgi-bin/cmudict&lt;/a&gt;&lt;br /&gt;&amp;#160; &amp;#160; &amp;#160; &amp;#160; &amp;#160;- silence phones&lt;br /&gt;&amp;#160; &amp;#160; d. lexicon&lt;br /&gt;&amp;#160; &amp;#160; &amp;#160; &amp;#160; &amp;#160;- see &lt;a href=&quot;http://www.speech.cs.cmu.edu/cgi-bin/cmudict&quot; rel=&quot;nofollow&quot; target=&quot;_blank&quot;&gt;http://www.speech.cs.cmu.edu/cgi-bin/cmudict&lt;/a&gt;&lt;br /&gt;&amp;#160; &amp;#160; &amp;#160; &amp;#160; &amp;#160;- find stressed v. unstressed examples&lt;br /&gt;3. out-of-vocabulary (OOV)&lt;br /&gt;&amp;#160; &amp;#160; &amp;#160; &amp;#160; &amp;#160;- see &lt;a href=&quot;http://gee.su/20QuL&quot; rel=&quot;nofollow&quot; target=&quot;_blank&quot;&gt;http://www.speech.cs.cmu.edu/tools/lextool.html&lt;/a&gt;&lt;br /&gt;&amp;#160; &amp;#160; &amp;#160; &amp;#160; &amp;#160;- see &lt;a href=&quot;https://github.com/sequitur-g2p/sequitur-g2p&quot; rel=&quot;nofollow&quot; target=&quot;_blank&quot;&gt;https://github.com/sequitur-g2p/sequitur-g2p&lt;/a&gt;&lt;br /&gt;&amp;#160; &amp;#160; &amp;#160; &amp;#160; &amp;#160; &amp;#160; &amp;#160;- (`tmux session=sequitur` on desktop for demo)&lt;br /&gt;4. What to expect next week&lt;br /&gt;&amp;#160; &amp;#160; - resources, see schedule: &lt;a href=&quot;https://docs.google.com/document/d/1pXtVbTodwWpyf9Z6F1z8O3DK6V4nZwPtFmWpKbIeyqc&quot; rel=&quot;nofollow&quot; target=&quot;_blank&quot;&gt;https://docs.google.com/document/d/1pXt &amp;#8230; mWpKbIeyqc&lt;/a&gt;&lt;br /&gt;&amp;#160; &amp;#160; - 2.1&lt;br /&gt;&amp;#160; &amp;#160; &amp;#160; &amp;#160; - in shell using IRSTLM (manual in resource_files)&lt;br /&gt;&amp;#160; &amp;#160; &amp;#160; &amp;#160; - using a toy corpus&lt;br /&gt;&amp;#160; &amp;#160; &amp;#160; &amp;#160; &amp;#160; &amp;#160; - &amp;quot;real&amp;quot; language model will be built in Week 3&lt;br /&gt;&amp;#160; &amp;#160; - 2.2&lt;br /&gt;&amp;#160; &amp;#160; &amp;#160; &amp;#160; - in python&lt;br /&gt;&amp;#160; &amp;#160; &amp;#160; &amp;#160; - &amp;quot;exploratory&amp;quot; with &amp;quot;case study&amp;quot;&lt;br /&gt;&amp;#160; &amp;#160; - 2.HW&lt;br /&gt;&amp;#160; &amp;#160; &amp;#160; &amp;#160; &amp;#160;- due before Week 3 class&lt;/p&gt;</description>
			<author>mybb@mybb.ru (Michael Capizzi)</author>
			<pubDate>Wed, 17 Jan 2018 00:48:17 +0300</pubDate>
			<guid>https://uakaldiinstructional.mybb.rocks/viewtopic.php?pid=12#p12</guid>
		</item>
		<item>
			<title>Week 0 Agenda</title>
			<link>https://uakaldiinstructional.mybb.rocks/viewtopic.php?pid=11#p11</link>
			<description>&lt;div class=&quot;quote-box answer-box&quot;&gt;&lt;cite&gt;hammond wrote:&lt;/cite&gt;&lt;blockquote&gt;&lt;p&gt;I just ran notebook &amp;quot;0_1&amp;quot; and did run_sh with -n 4. It was a lot faster, something like 30-45 minutes.&lt;/p&gt;&lt;/blockquote&gt;&lt;/div&gt;
						&lt;p&gt;Yes!&amp;#160; Since we have 4 CPUs on our instance, you can do that.&amp;#160; Anything larger than 4 probably won&#039;t make a difference though.&lt;/p&gt;</description>
			<author>mybb@mybb.ru (Michael Capizzi)</author>
			<pubDate>Wed, 17 Jan 2018 00:34:58 +0300</pubDate>
			<guid>https://uakaldiinstructional.mybb.rocks/viewtopic.php?pid=11#p11</guid>
		</item>
		<item>
			<title>General Info</title>
			<link>https://uakaldiinstructional.mybb.rocks/viewtopic.php?pid=2#p2</link>
			<description>&lt;p&gt;&lt;strong&gt;Course Length&lt;/strong&gt;&lt;br /&gt;First meeting will be 1/12&lt;br /&gt;Final meeting will be 3/23&lt;/p&gt;
						&lt;p&gt;&lt;strong&gt;Time and Location&lt;/strong&gt;&lt;br /&gt;We will meet from 12:00 to 1:45 on Fridays in Douglas 216 (the &amp;quot;conference&amp;quot; room).&lt;/p&gt;
						&lt;p&gt;&lt;strong&gt;Schedule&lt;/strong&gt;&lt;br /&gt;The Google Doc of the proposed schedule is &lt;a href=&quot;http://gee.su/gK3UR&quot; rel=&quot;nofollow&quot; target=&quot;_blank&quot;&gt;here&lt;/a&gt;.&lt;/p&gt;
						&lt;p&gt;&lt;strong&gt;Repository&lt;/strong&gt;&lt;br /&gt;The repository we will be using is a branch (kaldi_instructional) off of the main kaldi github repository.&amp;#160; It can be found &lt;a href=&quot;http://gee.su/I4wqb&quot; rel=&quot;nofollow&quot; target=&quot;_blank&quot;&gt;here&lt;/a&gt;.&lt;/p&gt;
						&lt;p&gt;&lt;strong&gt;Atmosphere Setup and Usage&lt;/strong&gt;&lt;br /&gt;See &lt;a href=&quot;https://github.com/michaelcapizzi/kaldi/blob/kaldi_instructional/egs/INSTRUCTIONAL/resource_files/using_atmosphere/using_atmosphere.md&quot; rel=&quot;nofollow&quot; target=&quot;_blank&quot;&gt;this page&lt;/a&gt; for instructions on setting up Atmosphere instance.&amp;#160; &lt;br /&gt;**Note**: The screenshots likely won&#039;t render when viewing on Github....sorry.&lt;/p&gt;
						&lt;p&gt;&lt;strong&gt;Atmosphere Allocation&lt;/strong&gt;&lt;br /&gt;We should have been given 2304 allocation units (AU).&amp;#160; If you do *not* see that much allocation on your dashboard, let Michael Capizzi know.&lt;br /&gt;That equates to a 4 CPU machine running 24 hours for 24 days. So if you use the VMs and shut them off when not in use that&#039;ll give you compute for the whole month except for 6 days.&lt;/p&gt;
						&lt;p&gt;&lt;strong&gt;Docker Image&lt;/strong&gt;&lt;br /&gt;The docker image we will be using can be found &lt;a href=&quot;https://hub.docker.com/r/mcapizzi/kaldi_instructional/&quot; rel=&quot;nofollow&quot; target=&quot;_blank&quot;&gt;here&lt;/a&gt;.&amp;#160; If you are planning on using your own machine, you&#039;ll need to access this image.&amp;#160; See `Pulling from Dockerhub` in the project `README`.&lt;/p&gt;
						&lt;p&gt;&lt;strong&gt;General Usage&lt;/strong&gt;&lt;br /&gt;See &lt;a href=&quot;https://github.com/michaelcapizzi/kaldi/blob/kaldi_instructional/README.md&quot; rel=&quot;nofollow&quot; target=&quot;_blank&quot;&gt;the project README&lt;/a&gt; for instructions on using the `docker` `container` and `jupyter notebook`.&lt;/p&gt;</description>
			<author>mybb@mybb.ru (Michael Capizzi)</author>
			<pubDate>Tue, 28 Nov 2017 02:18:32 +0300</pubDate>
			<guid>https://uakaldiinstructional.mybb.rocks/viewtopic.php?pid=2#p2</guid>
		</item>
	</channel>
</rss>
