typst is a good LaTeX alternative. Markdown-like syntax with fast rendering. Mostly useful for researchers using LaTeX. But publishers / journals don't accept typst often. #markdown #speech-to-text#chatgpt #gpu #speech-to-text#speech-to-text#automation #code-agents #gpu #markdown #optimization #prompt-engineering #speech-to-text#chatgpt #speech-to-text #tts#automation #prompt-engineering #speech-to-text #voice-cloning #ai-codingpython asks it to use uv.#markdown #speech-to-text#chatgpt #speech-to-text #tts #voice-cloning#speech-to-text #tts #voice-cloninguv). #gpu #speech-to-text#document-conversion #markdown #speech-to-text#document-conversion #markdown #speech-to-text#speech-to-text #voice-cloning#html #speech-to-text #web-dev#llm-ops #speech-to-text#speech-to-text#speech-to-text #voice-cloning#speech-to-text #tts #voice-cloning#speech-to-text #tts #voice-cloning#future #huggingface #speech-to-text #tts #voice-cloning#speech-to-text #tts
Unitextify,
ConvertCase, and
LingoJam.#speech-to-text #voice-cloning#llm-ops #speech-to-text #voice-cloning#future #llm-ops #speech-to-text #voice-cloning#future #speech-to-text #voice-cloning#speech-to-text#speech-to-text #tts #voice-cloning#future #speech-to-text #tts #voice-cloning#speech-to-text #voice-cloning #5478#speech-to-text #tts #voice-cloning
In comparison, WhisperX (with GPU) was much slower, had slightly poorer diarization, and slightly better transcription.uvx --python 3.9 --index https://download.pytorch.org/whl/cu121 whisperx --diarize --lang en --hf_token $HUGGINGFACE_TOKEN
#future #speech-to-text #hard#chatgpt #pricing #speech-to-text #tts#speech-to-text<cough>..., <laugh>..., etc. But at ~$15/hr of output, it's too expensive. Ref #speech-to-text #tts #voice-cloning#speech-to-text #voice-cloning#future #html #speech-to-text#speech-to-text #tts#speech-to-text #voice-cloning[inaudible].#chatgpt #speech-to-text #voice-cloning#ai-art #future #image-generation #speech-to-text#speech-to-text #tts#future #speech-to-text #tts #voice-cloning#speech-to-text #tts #voice-cloning#speech-to-text #tts #voice-cloning#speech-to-text #tts #voice-cloning
#future #speech-to-text #voice-cloning#speech-to-text #tts#chatgpt #speech-to-text #tts #voice-cloning#prompt-engineering #python #speech-to-text#speech-to-text #tts #voice-cloning#speech-to-text #voice-cloning#speech-to-text #voice-cloning#document-conversion #future #speech-to-text#document-conversion #future #github #speech-to-text#speech-to-text #voice-cloning#speech-to-text #tts #voice-cloning
#embeddings #speech-to-text #voice-cloning
unoti/voice-embeddings,
retkowsky/audio_embeddings,
pyannote/embedding (for speaker similarity),
and more.#future #speech-to-text #voice-cloning#future #huggingface #speech-to-text #voice-cloning#speech-to-text#speech-to-text #voice-cloning#speech-to-text #voice-cloning#ai-coding #automation #prompt-engineering #speech-to-text #tts#future #huggingface #speech-to-text#future #huggingface #speech-to-text#speech-to-text #voice-cloningwhisper-faster.exe video.mkv --language=English --model=medium generates the transcript. #speech-to-text#chatgpt #speech-to-text #tts#speech-to-text #voice-cloning#embeddings #future #speech-to-text #tts #voice-cloning#chatgpt #code-agents #github #speech-to-text#document-conversion #speech-to-text #tts #voice-cloning#ai-coding #github #speech-to-text#automation #chatgpt #code-agents #prompt-engineering #speech-to-text #tts