FAQs

Technical FAQs for Data Distill

  • Data Distill uses API Key Authentication for secure access. You include your API key in the request headers:

  • Send a document for asynchronous processing by specifying a callback_url in your request payload. Results will be sent to your webhook when processing is complete.
    Example Payload:

  • Data Distill supports:
    • Documents: PDF, DOCX, TXT
    • Images: JPG, PNG, TIFF
    • Audio: MP3, WAV
  • Data Distill’s API automatically detects and extracts tables, including nested tables andcomplex layouts. You receive the data in JSON format, preserving the table structure.

  • The maximum file size limit depends on your plan:
    • Starter: 5 MB
    • Business: 20 MB
    • Premium & Enterprise: 100 MB
    For larger files, split them into smaller parts before uploading.
  • Check the status codes and error messages in the API response. Common errors include:
    • 400: Bad request (invalid payload)
    • 401: Unauthorized (Invalid API Key)
    • 500: Internal server error
    Example Error Response:
  • The maximum file size limit depends on your plan:
    • Batch Processing: Submit multiple documents in a single API call.
    • Asynchronous Calls: Use asynchronous processing for large-scale tasks.
    • Webhooks: Receive real-time results via webhooks to avoid polling.
  • Rate limits vary by plan. If you exceed the limit, the API returns a 429 (Too Many Requests) response. Implement retry logic with exponential backoff.
    Example Retry Logic in Python:

  • Include a custom template in your API request to specify the fields you need
    Example Payload:

  • Secure your webhook by:
    • Using HTTPS for encrypted communication.
    • Verifying Signatures: Check the signature included in the webhook payload to confirm authenticity.
  • Send an audio file to the transcription API endpoint and specify your callback_url.Example Request:

  • Data Distill supports over 60 languages for OCR and transcription, including English, Spanish, French, German, Chinese, Japanese, and more.

  • SYou can monitor your credit usage via the dashboard or by making an API call to the usage endpoint.Example Usage Request:

  • Data Distill is compliant with:
    • GDPR
    • HIPAA
    • CCPA We process data securely with end-to-end encryption and do not store any data. For maximum compliance, you can request a BAA (Business Associate Agreement) for HIPAA-related use cases.

Need More Technical Information?