Setup¶
In [ ]:
Copied!
%pip install pandas
# Download Indexify Server
!curl https://getindexify.ai | sh
# Download Extractors
!indexify-extractor download tensorlake/whisper-asr
%pip install pandas
# Download Indexify Server
!curl https://getindexify.ai | sh
# Download Extractors
!indexify-extractor download tensorlake/whisper-asr
After installing the necessary libraries, download the server, and the extractors, you need to restart the runtime. Then, you have to run Indexify Server with the Extractors.
Open 2 terminals and run the following commands:
# Terminal 1
./indexify server -d
# Terminal 2
indexify-extractor join-server
Transcribing Audio¶
In [1]:
Copied!
from indexify import IndexifyClient
client = IndexifyClient()
from indexify import IndexifyClient
client = IndexifyClient()
In [ ]:
Copied!
extraction_graph_spec = """
name: "audio"
extraction_policies:
- extractor: "tensorlake/whisper-asr"
name: "transcription"
"""
extraction_graph = ExtractionGraph.from_yaml(extraction_graph_spec)
client.create_extraction_graph(extraction_graph)
extraction_graph_spec = """
name: "audio"
extraction_policies:
- extractor: "tensorlake/whisper-asr"
name: "transcription"
"""
extraction_graph = ExtractionGraph.from_yaml(extraction_graph_spec)
client.create_extraction_graph(extraction_graph)
In [ ]:
Copied!
# Change the path to the audio file you want to upload.
PATH = ""
content_id = client.upload_file("audio", path=PATH)
client.wait_for_extraction(content_id)
# Change the path to the audio file you want to upload.
PATH = ""
content_id = client.upload_file("audio", path=PATH)
client.wait_for_extraction(content_id)