Transcribing Audio with Indexify¶

Join Discord if you need help + ⭐ Star us on Github ⭐

In this notebook, we will use an Indexify Extractor (Whisper ASR) to transcribe audio files to texts.

Setup¶

In [ ]:

Copied!

%pip install pandas

# Download Indexify Server
!curl https://getindexify.ai | sh

# Download Extractors
!indexify-extractor download tensorlake/whisper-asr
%pip install pandas

# Download Indexify Server
!curl https://getindexify.ai | sh

# Download Extractors
!indexify-extractor download tensorlake/whisper-asr

After installing the necessary libraries, download the server, and the extractors, you need to restart the runtime. Then, you have to run Indexify Server with the Extractors.

Open 2 terminals and run the following commands:

# Terminal 1
./indexify server -d

# Terminal 2
indexify-extractor join-server

Transcribing Audio¶

In [1]:

Copied!

from indexify import IndexifyClient
client = IndexifyClient()
from indexify import IndexifyClient
client = IndexifyClient()

In [ ]:

Copied!





extraction_graph_spec = """
name: "audio"
extraction_policies:
  - extractor: "tensorlake/whisper-asr"
    name: "transcription"
"""

extraction_graph = ExtractionGraph.from_yaml(extraction_graph_spec)
client.create_extraction_graph(extraction_graph)
extraction_graph_spec = """
name: "audio"
extraction_policies:
  - extractor: "tensorlake/whisper-asr"
    name: "transcription"
"""

extraction_graph = ExtractionGraph.from_yaml(extraction_graph_spec)
client.create_extraction_graph(extraction_graph) 

In [ ]:

Copied!





# Change the path to the audio file you want to upload.
PATH = ""
content_id = client.upload_file("audio", path=PATH)
client.wait_for_extraction(content_id)
# Change the path to the audio file you want to upload.
PATH = ""
content_id = client.upload_file("audio", path=PATH)
client.wait_for_extraction(content_id)