Q: Prefered format source documents
Hi,
Does it matter to Socrates what filetype the documents are in?
Like pdf versus Word versus excel or epub or Markdown?
And what limits are there? I mean what if I have textbook pdf with like thousand pages. In other ai tool I splitted that document into like 12 chapters to prevent ai hallucinating. Is that something you should do with Socrates too? And if so, at what size should you do that?
Jon_Socrates
Nov 23, 2024A: Hello, Socrates supports all those document types, but PDFs have a bit more structure and thus can provide additional context to LLMs to give a slightly better result (we auto-convert DOCX files to PDF during Deep Dives).
Yes, we handle auto-splitting of long PDFs, so you don't have to worry about the size of the documents.