Datasaur
Visit our websitePricingBlogPlaygroundAPI Docs
  • Welcome to Datasaur
    • Getting started with Datasaur
  • Data Studio Projects
    • Labeling Task Types
      • Span Based
        • OCR Labeling
        • Audio Project
      • Row Based
      • Document Based
      • Bounding Box
      • Conversational
      • Mixed Labeling
      • Project Templates
        • Test Project
    • Creating a Project
      • Data Formats
      • Data Samples
      • Split Files
      • Consensus
      • Dynamic Review Capabilities
    • Pre-Labeled Project
    • Let's Get Labeling!
      • Span Based
        • Span + Line Labeling
      • Row & Document Based
      • Bounding Box Labeling
      • Conversational Labeling
      • Label Sets / Question Sets
        • Dynamic Question Set
      • Multiple Label Sets
    • Reviewing Projects
      • Review Sampling
    • Adding Documents to an Ongoing Project
    • Export Project
  • LLM Projects
    • LLM Labs Introduction
    • Sandbox
      • Direct Access LLMs
      • File Attachment
      • Conversational Prompt
    • Deployment
      • Deployment API
    • Knowledge base
      • External Object Storage
      • File Properties
    • Models
      • Amazon SageMaker JumpStart
      • Amazon Bedrock
      • Open AI
      • Azure OpenAI
      • Vertex AI
      • Custom model
      • Fine-tuning
      • LLM Comparison Table
    • Evaluation
      • Automated Evaluation
        • Multi-application evaluation
        • Custom metrics
      • Ranking (RLHF)
      • Rating
      • Performance Monitoring
    • Dataset
    • Pricing Plan
  • Workspace Management
    • Workspace
    • Role & Permission
    • Analytics
      • Inter-Annotator Agreement (IAA)
        • Cohen's Kappa Calculation
        • Krippendorff's Alpha Calculation
      • Custom Report Builder
      • Project Report
      • Evaluation Metrics
    • Activity
    • File Transformer
      • Import Transformer
      • Export Transformer
      • Upload File Transformer
      • Running File Transformer
    • Label Management
      • Label Set Management
      • Question Set Management
    • Project Management
      • Self-Assignment
        • Self-Unassign
      • Transfer Assignment Ownership
      • Reset Labeling Work
      • Mark Document as Complete
      • Project Status Workflow
        • Read-only Mode
      • Comment Feature
      • Archive Project
    • Automation
      • Action: Create Projects
  • Assisted Labeling
    • ML Assisted Labeling
      • Amazon Comprehend
      • Amazon SageMaker
      • Azure ML
      • CoreNLP NER
      • CoreNLP POS
      • Custom API
      • FewNERD
      • Google Vertex AI
      • Hugging Face
      • LLM Assisted Labeling
        • Prompt Examples
        • Custom Provider
      • LLM Labs (beta)
      • NLTK
      • Sentiment Analysis
      • spaCy
      • SparkNLP NER
      • SparkNLP POS
    • Data Programming
      • Example of Labeling Functions
      • Labeling Function Analysis
      • Inter-Annotator Agreement for Data Programming
    • Predictive Labeling
  • Assisted Review
    • Label Error Detection
  • Building Your Own Model
    • Datasaur Dinamic
      • Datasaur Dinamic with Hugging Face
      • Datasaur Dinamic with Amazon SageMaker Autopilot
  • Advanced
    • Script-Generated Question
    • Shortcuts
    • Extensions
      • Labels
      • Review
      • Document and Row Labeling
      • Bounding Box Labels
      • List of Files
      • Comments
      • Analytics
      • Dictionary
      • Search
      • Labeling Guidelines
      • Metadata
      • Grammar Checker
      • ML Assisted Labeling
      • Data Programming
      • Datasaur Dinamic
      • Predictive Labeling
      • Label Error Detection
      • LLM Sandbox
    • Tokenizers
  • Integrations
    • External Object Storage
      • AWS S3
        • With IRSA
      • Google Cloud Storage
      • Azure Blob Storage
      • Dropbox
    • SAML
      • Okta
      • Microsoft Entra ID
    • SCIM
      • Okta
      • Microsoft Entra ID
    • Webhook Notifications
      • Webhook Signature
      • Events
      • Custom Headers
    • Robosaur
      • Commands
        • Create Projects
        • Apply Project Tags
        • Export Projects
        • Generate Time Per Task Report
        • Split Document
      • Storage Options
  • API
    • Datasaur APIs
    • Credentials
    • Create Project
      • New mutation (createProject)
      • Python Script Example
    • Adding Documents
    • Labeling
      • Create Label Set
      • Add Label Sets into Existing Project
      • Get List of Label Sets in a Project
      • Add Label Set Item into Project's Label Set
      • Programmatic API Labeling
      • Inserting Span and Arrow Label into Document
    • Export Project
      • Custom Webhook
    • Get Data
      • Get List of Projects
      • Get Document Information
      • Get List of Tags
      • Get Cabinet
      • Export Team Overview
      • Check Job
    • Custom OCR
      • Importable Format
    • Custom ASR
    • Run ML-Assisted Labeling
  • Security and Compliance
    • Security and Compliance
      • 2FA
  • Compatibility & Updates
    • Common Terminology
    • Recommended Machine Specifications
    • Supported Formats
    • Supported Languages
    • Release Notes
      • Version 6
        • 6.111.0
        • 6.110.0
        • 6.109.0
        • 6.108.0
        • 6.107.0
        • 6.106.0
        • 6.105.0
        • 6.104.0
        • 6.103.0
        • 6.102.0
        • 6.101.0
        • 6.100.0
        • 6.99.0
        • 6.98.0
        • 6.97.0
        • 6.96.0
        • 6.95.0
        • 6.94.0
        • 6.93.0
        • 6.92.0
        • 6.91.0
        • 6.90.0
        • 6.89.0
        • 6.88.0
        • 6.87.0
        • 6.86.0
        • 6.85.0
        • 6.84.0
        • 6.83.0
        • 6.82.0
        • 6.81.0
        • 6.80.0
        • 6.79.0
        • 6.78.0
        • 6.77.0
        • 6.76.0
        • 6.75.0
        • 6.74.0
        • 6.73.0
        • 6.72.0
        • 6.71.0
        • 6.70.0
        • 6.69.0
        • 6.68.0
        • 6.67.0
        • 6.66.0
        • 6.65.0
        • 6.64.0
        • 6.63.0
        • 6.62.0
        • 6.61.0
        • 6.60.0
        • 6.59.0
        • 6.58.0
        • 6.57.0
        • 6.56.0
        • 6.55.0
        • 6.54.0
        • 6.53.0
        • 6.52.0
        • 6.51.0
        • 6.50.0
        • 6.49.0
        • 6.48.0
        • 6.47.0
        • 6.46.0
        • 6.45.0
        • 6.44.0
        • 6.43.0
        • 6.42.0
        • 6.41.0
        • 6.40.0
        • 6.39.0
        • 6.38.0
        • 6.37.0
        • 6.36.0
        • 6.35.0
        • 6.34.0
        • 6.33.0
        • 6.32.0
        • 6.31.0
        • 6.30.0
        • 6.29.0
        • 6.28.0
        • 6.27.0
        • 6.26.0
        • 6.25.0
        • 6.24.0
        • 6.23.0
        • 6.22.0
        • 6.21.0
        • 6.20.0
        • 6.19.0
        • 6.18.0
        • 6.17.0
        • 6.16.0
        • 6.15.0
        • 6.14.0
        • 6.13.0
        • 6.12.0
        • 6.11.0
        • 6.10.0
        • 6.9.0
        • 6.8.0
        • 6.7.0
        • 6.6.0
        • 6.5.0
        • 6.4.0
        • 6.3.0
        • 6.2.0
        • 6.1.0
        • 6.0.0
      • Version 5
        • 5.63.0
        • 5.62.0
        • 5.61.0
        • 5.60.0
  • Deployment
    • Self-Hosted
      • AWS Marketplace
        • Data Studio
        • LLM Labs
Powered by GitBook
On this page
  • Overview
  • Setup
  • Supported Azure OpenAI Models
  • How to Integrate Azure Open AI Using Certificate
  • Disconnecting Azure OpenAI
  • Availability
  • Conclusion
  1. LLM Projects
  2. Models

Azure OpenAI

Last updated 2 days ago

Overview

The Azure OpenAI integration feature in Datasaur's LLM Labs allows you to seamlessly integrate your own Large Language Models (LLMs) into the platform. This functionality provides three key benefits:

  1. Integrating your own LLMs: This feature enables you to bring your own models into LLM Labs. Datasaur will retrieve all available Azure OpenAI models based on your credentials. Once connected, these models will be accessible to all workspace members.

  2. Protect your data: By using your own LLMs, you can ensure better security compliance, providing you with complete control and privacy over your data.

  3. Streamlined workflow: Access your preferred models directly within Datasaur's user-friendly interface, simplifying your LLM workflows to allows you to gain insights about your model's performance by comparing it to other models, both within and outside of Azure.

Setup

  1. Navigate to the Models page under LLM Labs menu.

  2. Open My models tab, and click the Manage providers button.

  1. Choose Azure OpenAI as the provider, and insert your Tenant ID, Subscription ID, Resource group name, and Account name.

  1. If this is your first time integrating Azure Open AI with Datasaur, click the Add Datasaur to your Azure Tenant button to grant consent for Datasaur's App Registration to your tenant. If you have previously done this, you can skip this step.

    • You must have at least the Privileged Role Administrator role to grant consent to App Registrations for delegated access. Continue the process on a new tab and authorize the Datasaur app. Note that you need Admin permission to complete this action.

  1. Grant role assignments to Datasaur's Service Principal

    1. Still in the Azure OpenAI Account, find Access Control (IAM) on the sidebar.

    2. Click on Add > Add role assignment.

    3. Select Cognitive Services OpenAI Contributor, then click Next.

    4. In Assign access to, click on User, group, or service principal, then click select members.

    5. Search for Datasaur Azure AI Integrator (you cannot see the service principals from the list, you must search for them), then add it. This step will not be available if you haven't added Datasaur to your Azure tenant as described above.

    6. Proceed by clicking "Next" and then click "Next" once more when you reach the Conditions section.

    7. Click on Review + assign.

  2. Getting the Tenant ID:

    1. Under the Overview tab, copy the Tenant ID attribute under the Basic information section.

  3. Getting the Subscription ID, Resource Group Name, and Account Name:

    1. Go to the Azure OpenAI page from the Azure portal.

    2. Create or select an Azure OpenAI Account.

    3. From the sidebar, click on Overview menu.

    4. You should find these fields under the Essentials section Subscription ID should be available as “Subscription ID” Resource Group Name should be available as “Resource group” Account Name is the name of the selected Azure OpenAI Account.

  1. Once you have connected your Azure OpenAI Models to LLM Labs, you will see a list of available LLM Models that you have already deployed in Azure OpenAI. You can immediately use these models within LLM Labs.

Every new model that you deploy in Azure OpenAI will be synced to Datasaur, and you can use it right away in Datasaur. If the models you just deployed haven't appeared on Datasaur, you can click Sync models.

Supported Azure OpenAI Models

Currently, Datasaur only provides support for text generation Large Language Models.

Additionally, you can bring your own fine-tuned models from Azure OpenAI into Datasaur, allowing for greater customization and performance tailored to your specific use cases.

How to Integrate Azure Open AI Using Certificate

This section provides a step-by-step guide to integrate Azure Open AI with Datasaur's LLM Labs using a certificate for secure authentication. Follow the instructions below to generate credentials, upload the certificate to Azure, and connect Azure Open AI to LLM Labs.

If you are connecting to Azure OpenAI using a certificate, note that only the deployment name that matches the format modelName-modelVersion will be integrated with LLM Labs.

Step 1: Generate Credentials

To integrate Azure Open AI using a certificate, you need to generate a private key, extract the public key, and combine them into a single PEM file.

  1. Generate a private key

    If you don’t have a private key, generate one using the following command:

    openssl genrsa -out private.pem 2048

  2. Extract the public key from the private key

    Use the private key to generate a public key in .pem format:

    openssl req -new -x509 -key private.pem -out public.pem -days 365

  3. Combine keys into one PEM file

    Combine the private and public keys into a single .pem file:

    cat private.pem public.pem > fullkeys.pem

Step 2: Upload the Certificate to Azure App Registration

  1. Navigate to App Registrations

    • Go to Azure Active Directory > App registrations.

    • Select your application.

  2. Upload the certificate

    • In the left menu, go to Certificates & secrets.

    • Under the Certificates section, click Upload certificate.

    • Choose your .pem (public key) file and upload it.

  3. Retrieve Tenant ID and Client ID

    • On your application page, go to the Overview section.

    • Copy the Tenant ID and Client ID. These will be required to connect Azure Open AI to LLM Labs.

    • Ensure your application has access to the Azure Open AI service you want to connect to.

Step 3: Connect Azure Open AI Provider in LLM Labs

  1. Navigate to the Models catalog in your Datasaur workspace, and click the Manage providers button.

  2. Choose Azure OpenAI as the provider, and select Azure tenant application certificate.

  3. Enter your Tenant ID, Client ID, and Azure OpenAI endpoint.

  4. Upload the .pem file containing both the private and public keys (created in Step 1).

Disconnecting Azure OpenAI

To disconnect Azure OpenAI from Datasaur, follow these steps:

  1. Go to the Models page under the LLM Labs menu.

  2. Open My models tab and select Manage providers.

  1. Click See details next to the Azure OpenAI provider.

  1. Click Disconnect and confirm the action.

Availability

The model will be accessible to all workspace members for use in their projects. Additionally, only the Admin can remove the Azure OpenAI provider from the workspace.

Conclusion

Integrating your own Azure OpenAI models with Datasaur not only enhances functionality but also ensures that your data remains secure and compliant with your organization's policies. By leveraging your own LLMs, you gain greater control, customization, and security for your AI applications.

Go to your Microsoft Entra ID from the Azure portal. To learn more about Microsoft Entra ID, please visit this .

Datasaur supports a wide range of models available in Azure OpenAI. You can find the list of supported models here: .

Access Azure Portal: Log in to the .

For further assistance or to get started, please reach out to our support team at .

link
Azure OpenAI Models
Azure Portal
support@datasaur.ai
Manage providers button
Setting up credentials
Request permission
Azure OpenAI Studio Dashboard
Azure OpenAI connected