Theundatasiopackage from UnDatasIO extracts clean text from raw source documents like PDFs. This page covers how to use theundatasioecosystem within LangChain.
Installation and Setup
-
Install the Python SDK with
pip install undatasio
along with
pip install langchain-undatasio
to use theUnDatasIOLoaderand partition remotely against the UnDatasIO API. You will need an API key, which you can generate for free at
undatas.io. - No local system dependencies are required; all processing runs in the cloud.
Data Loaders
The primary usage ofUnDatasIO is through the document loader.