Skip to content

Term Archive

AI

Entries connected to "AI", gathered in one place for quick browsing.

Entries 5
Categories 25
Tags 203

Collection filter

Refine this page

Narrow the entries already loaded here by title, summary, category, or tag.

5 entries shown

OpenAI Whisper Speech Recognition Guide

Whisper is a general-purpose speech recognition model. It is trained on a large dataset of diverse audio and is also a multi-task model that can perform multilingual speech recognition as well as speech translation and language identification. GitHub Repository Installation pip install git+https://github.com/openai/whisper.git Fix CUDA not detecting GPU Whisper will default to the CPU if a GPU is not detected, which is considerably slower. pip uninstall torch pip install torch torchvision torchaudio --extra-index-url https://download.pytorch.org/whl/cu116 Example usage # Transcribe whisper input.mp3 --model medium.en --language en --task transcribe # Translate whisper japanese.wav --model large --language Japanese --task translate Available models and languages There are five model sizes, four with English-only versions, offering speed and accuracy tradeoffs. Below are the names of the available models and their approximate memory requirements and relative speed.

OpenAI Whisper Speech Recognition Guide

MiDaS Depth Estimation Guide

GitHub Repository During installation, I ran into an issue where the CUDA package wasn’t found. Had to modify environment.yaml to: name: midas-py310 channels: - pytorch - defaults dependencies: - nvidia::cuda-toolkit=11.7.0 - python=3.10.8 - pytorch::pytorch=1.13.0 - torchvision=0.14.0 - pip=22.3.1 - numpy=1.23.4 - pip: - opencv-python==4.6.0.66 - imutils==0.5.4 - timm==0.6.12 - einops==0.6.0 Commands that were helpful for troubleshooting CUDA:

MiDaS Depth Estimation Guide

Recently Created

Latest entries created

The 10 most recently created entries, sorted by each entry's publish date.

  1. Window Monitoring with Machine Vision Mar 25, 2024
  2. 3D Photo Inpainting with Python and PyTorch May 12, 2023
  3. OpenAI Whisper Speech Recognition Guide Feb 24, 2023
  4. MiDaS Depth Estimation Guide Feb 6, 2023
  5. Stable Diffusion Scripts Feb 5, 2023

Recently Updated

Latest revisions across entries

The 10 most recently updated entries, sorted by each entry's last modification date.

  1. Window Monitoring with Machine Vision Jan 15, 2026
  2. Stable Diffusion Scripts Jan 15, 2026
  3. 3D Photo Inpainting with Python and PyTorch Dec 12, 2025
  4. OpenAI Whisper Speech Recognition Guide Dec 12, 2025
  5. MiDaS Depth Estimation Guide Dec 12, 2025

Relationship Map

Memory Field

Use the relationship map to follow related categories, tags, and entries beyond the current list.

Categories 0
Tags 0
Entries 0