Connect your data

Data is at the heart of every successful machine learning project. Without it, there can be no insight, no prediction, and no progress. That's why Octai makes it easy to connect to the data sources you need. There are 120+ data connectors available, and if we miss your required connector, we develop it for you for free.

You can access various data sources, such as local files like Excel sheets, CSV files; databases like PostgreSQL, MySQL; cloud resources like AWS S3, Google Cloud Storage, BigQuery, or big data clusters like Apache Hadoop.

But that's not all. Our marketplace offers a range of pre-built datasets, including weather files, to help you get started with your projects quickly and efficiently.

In this guide, we'll walk you through the steps to connect your datasets, so you can get started on building your next machine learning project with ease. Whether you're a seasoned data scientist or a beginner in the field, Octai has got you covered.

Local Files

Simply drag & drop your local file and Octai automatically handles the rest.

You can upload any file with these extensions

  • csv
  • tsv
  • xlx
  • xlsx
  • feather
  • hdf
  • parquet

or you can upload archives of those in these formats

  • zip
  • rar
  • 7z
  • tar
  • gz

Connectors

Access to data from databases, cloud resources and marketing tools is available. This is an example connection to a PostgresDB,

Here is the full list of connectors that is available on Octai. Contact us if your connector is not listed here, and we will develop it for you for free.

  • AWS CloudTrail
  • Airtable
  • AlloyDB for PostgreSQL
  • Amazon Ads
  • Amazon SQS
  • Amazon Seller Partner
  • Amplitude
  • Apify Dataset
  • Asana
  • Auth0
  • Azure Table Storage
  • BambooHR
  • BigCommerce
  • BigQuery
  • Bing Ads
  • Braintree
  • Braze
  • Chargebee
  • Chartmogul
  • ClickHouse
  • ClickUp
  • Close.com
  • Coda
  • Coin API
  • CoinMarketCap
  • ConfigCat
  • Datascope
  • Delighted
  • Dixa
  • Dockerhub
  • Dremio
  • DynamoDB
  • EmailOctopus
  • End-to-End Testing (Mock API)
  • Exchange Rates Api
  • Facebook Marketing
  • Facebook Pages
  • Fauna
  • File (CSV, JSON, Excel, Feather, Parquet)
  • Firebolt
  • Freshcaller
  • Freshdesk
  • GNews
  • GetLago
  • GitHub
  • Gitlab
  • Glassfrog
  • Google Ads
  • Google Analytics (Universal Analytics)
  • Google Analytics 4 (GA4)
  • Google Directory
  • Google Search Console
  • Google Sheets
  • Google Webfonts
  • Google Workspace Admin Reports
  • Greenhouse
  • Gridly
  • Harvest
  • HubSpot
  • Hubplanner
  • IP2Whois
  • Insightly
  • Instagram
  • Instatus
  • Intercom
  • Iterable
  • Jira
  • K6 Cloud
  • Klarna
  • Klaviyo
  • Kustomer
  • LaunchDarkly
  • Lemlist
  • LinkedIn Ads
  • Linnworks
  • Lokalise
  • Looker
  • Mailchimp
  • Mailgun
  • Mailjet SMS
  • Marketo
  • Metabase
  • Microsoft SQL Server (MSSQL)
  • Microsoft teams
  • Mixpanel
  • Monday
  • MongoDb
  • My Hours
  • MySQL
  • Netsuite
  • New York Times
  • Notion
  • Okta
  • Omnisend
  • OpenWeather
  • Oracle DB
  • Orb
  • Orbit
  • Paypal Transaction
  • PersistIq
  • Pexels API
  • Pinterest
  • Pipedrive
  • Pocket
  • PokeAPI
  • Polygon Stock API
  • PostHog
  • Postgres
  • Postmark App
  • PrestaShop
  • Public APIs
  • Punk API
  • PyPI
  • Qualaroo
  • RKI Covid
  • Railz
  • Recharge
  • Recruitee
  • Recurly
  • Redshift
  • Retently
  • S3
  • SAP Fieldglass
  • SFTP
  • SFTP Bulk
  • SalesLoft
  • Salesforce
  • Salesforce (Singer)
  • Sample Data (Faker)
  • Secoda
  • Sendgrid
  • Sendinblue
  • Senseforce
  • Sentry
  • Shopify
  • Short.io
  • Slack
  • Smaily
  • SmartEngage
  • Smartsheets
  • Snapchat Marketing
  • Snowflake
  • Sonar Cloud
  • SpaceX API
  • Square
  • Stripe
  • SurveyMonkey
  • SurveySparrow
  • TVMaze Schedule
  • Tempo
  • The Guardian API
  • TikTok Marketing
  • Twilio
  • Twilio Taskrouter
  • Twitter
  • Typeform
  • US Census
  • Vantage
  • Webflow
  • Whisky Hunter
  • Wikipedia Pageviews
  • WooCommerce
  • YouTube Analytics
  • YouTube Analytics Business
  • Younium
  • Zendesk Chat
  • Zendesk Sunshine
  • Zendesk Support
  • Zendesk Talk
  • Zenloop
  • Zoom
  • Zuora
  • xkcd

Marketplace

Our marketplace also offers a range of pre-built weather datasets, enabling you to harness the power of weather data in your machine learning projects. By inputting your desired location, time range, and weather parameters, we can provide you with the data you need to build accurate and insightful models. Whether you're looking to predict energy demand, optimize transportation routes, or mitigate weather-related risks, our weather files give you the data you need to succeed. With Octai, you can leverage the latest weather data to gain a competitive edge in your industry.