Custom ASR

Custom ASR enables users to integrate and test their own Automatic Speech Recognition (ASR) APIs with Datasaur projects. To enable this feature in your workspace, contact us at [email protected].

Configuring Custom ASR

Step 1: Set Up the Sample API

Open the provided CodeSandbox example API here.
- To test this code, you can also create a live endpoint using codesandbox.io.
Save any changes to the codebase to automatically create a fork for further testing and modifications.
Copy the base URL of the API from the CodeSandbox preview panel (on the right-hand side). In addition, you may need to ensure the panel is expanded if it’s collapsed.
In the sample API that we provided, you can use the following endpoints based on your needs:
- {baseUrl}/text-extraction/example-text (returns plain text transcription).
- {baseUrl}/text-extraction/example-json (returns Importable JSON transcription).

Step 2: Add Custom API in Project Creation Wizard

Go to the Active Projects page and click “Create Project".
In Step 1: Upload, upload your image/PDF (for OCR) or audio file (for ASR) and click Next.
In Step 2: Preview, click the dropdown labeled “Apply ASR Method” (or “Apply OCR Method” for images/PDFs).
Select “+ Add New API…” to open a dialog box.
- Fill in the fields:
  - Name: Desired name for the API.
  - Custom API URL: Paste one of the two endpoints from our sample API.
  - Secret: Add any placeholder value (this can be ignored for now).
Click Save to add and select the API.
Review the sample transcription on the right panel.