ive been messing around a bunch with semantic vectorization and vector search and honestly its been really easy to get started with doing stuff using ChromaDB and sentence-transformers
the hardest thing I need to figure out now is how to actually get documents / PDFs into a searchable form... I'm not sure if Chroma does this automatically or if I need to chunk my embeddings