Q: Prefered format source documents

Hi,
Does it matter to Socrates what filetype the documents are in?

Like pdf versus Word versus excel or epub or Markdown?

And what limits are there? I mean what if I have textbook pdf with like thousand pages. In other ai tool I splitted that document into like 12 chapters to prevent ai hallucinating. Is that something you should do with Socrates too? And if so, at what size should you do that?

EZnlNov 22, 2024
Founder Team
Jon_Socrates

Jon_Socrates

Nov 23, 2024

A: Hello, Socrates supports all those document types, but PDFs have a bit more structure and thus can provide additional context to LLMs to give a slightly better result (we auto-convert DOCX files to PDF during Deep Dives).

Yes, we handle auto-splitting of long PDFs, so you don't have to worry about the size of the documents.

Share
Helpful?
Log in to join the conversation