File Formats Supported By TextCortex
TextCortex is engineered to work seamlessly with a range of text document formats. Some of the most commonly supported file types include:
- Text: .txt, .md, .html
- Documents: PDF, Word (.docx), PowerPoint (.pptx)
- Image: .png, .jpg (can only be uploaded in chat)
- Data: Excel (.xml, .xlsx), CSV
- Integration-specific sources: Web URLs, SharePoint Sites/Pages (.aspx), Google Docs, Google Sheets..
Size limits:
- Regular files: 300MB maximum per file
- Tabular data (Excel, CSV): 10 million character limit across all rows, columns, and sheets combined
Content density considerations:
Even files under 300MB may fail to upload if they contain extremely dense text content that exceeds processing limits. Files with dense formatting, extensive metadata, or highly compressed text (such as PDFs with small fonts or tables with thousands of entries) can sometimes hit character processing limits before reaching the file size limit.
If you encounter upload issues with smaller files, try:
- Breaking large documents into smaller sections
- Simplifying formatting or removing unnecessary metadata
- For spreadsheets, splitting data across multiple files if row/column count is very high
These formats are ideal for TextCortex because they are primarily text-based and allow the platform to analyze and process your written content effectively.
Integrations
To make the process of uploading files to your knowledge base easier, we also give users the option to connect third-party accounts such as Google Drive, Microsoft OneDrive and many more to come. These integrations allow users to import all of their supported files seamlessly with just one click.
See integrations for more information about Google Drive and Microsoft OneDrive integrations.