UltiHash documentation
← back to ultihash.io
  • Get started with UltiHash
  • Get help + troubleshooting
  • Installation
    • Test with Docker
    • Install Self-Hosted on-premises
    • Install Self-Hosted on AWS
    • Set up UltiHash Serverless
    • Migrate your data
  • Operations
    • Upload + download data
    • Use the S3-compatible API
    • Prebuilt connections
      • Airflow
      • AWS Glue
      • Iceberg
      • Icechunk
      • Kafka
      • Neo4j
      • Presto
      • PySpark
      • PyTorch
      • SuperAnnotate
      • Trino
      • Vector databases
    • Set up pre-signed URLs
    • Save space with deduplication
    • Delete stored data
    • Set up object versioning
  • Administration
    • Customize your deployment
    • Monitor your cluster
    • Scale your cluster
    • Update your cluster
    • Backup + restore your cluster
    • Manage users + access policies
    • Erasure coding for data resiliency
    • Set up encryption
  • Reference
    • Changelog
      • Core image
      • Helm chart
Powered by GitBook
On this page
  1. Operations

Prebuilt connections

How to connect UltiHash to the rest of your stack

Last updated 28 days ago

Was this helpful?

CtrlK

Was this helpful?

UltiHash offers a powerful S3-compatible API for connecting to a huge range of tools. Below are a selection of tools with tested custom integrations; many more can be easily connected via the API.

Cover

Airflow

Programmatically author, schedule and monitor workflows

Cover

AWS Glue

Event-driven, serverless data integration service

Cover

Delta Lake

Open-source storage framework for building lakehouses

Cover

Iceberg

High-performance format for huge analytic tables

Cover

Icechunk

Storage engine for tensor / ND-array data

Cover

Kafka

Distributed event streaming platform

Cover

Neo4j

Connect a graph database and retrieve raw data

Cover

Presto

Scalable SQL query engine for modern data analytics

Cover

PySpark

Open-source analytics for large-scale data processing

Cover

PyTorch

Library for deep learning on irregular inputs

Cover

SuperAnnotate

Centralize AI data needs and vendor management

Cover

Trino

Distributed SQL query engine for big data analytics

Cover

Vector databases

Connect a vector DB and retrieve raw data based on queries