notes
Contents:
C++
Sphinx
git
docker
LaTeX
Kaldi
bash
CUDA
torch
Python
java
javascript
HTML
css
pybind11
Protocol Buffers
gRPC
lwn.net
Linker and Loader
espnet
cmake
huggingface
EECS E6870 Speech Recognition
ncnn
LLVM
Android
qemu
sox
MNN
SIMD
asio
websocketpp
Operating systems
encoding
ios
Embedded systems
ssh
onnx
csharp
Flask
ARM
VirtualBox
Go
Whisper
Windows
qt
webassembly
spleeter
django
React
tts
rust
ELF
ROS2
OpenFst
Colab
Dart
Flutter
Keyword spotting (KWS)
Papers
Transformers
Programming
Diffusion
books
tts
Pascal
ggml
Amphion
HarmonyOS
icefall
RKNN
lhotse
ffmpeg
vlc
notes
Papers
Edit on GitHub
Papers
Transformers
2017-Attention is all you need
2020-An image is worth 16x16 words: Transformers for image recognition at scale
Programming
1971-Program development by stepwise refinement
Diffusion
Blogs
2020-Denoising diffusion probabilistic models
2022-Denoising diffusion implicit models
books
tts
2024-Voicebox: Text-guided multilingual universal speech generation at scale
2024-E2 TTS: Embarrassingly easy fully non-autoregressive zero-shot TTS