Icon

GPT4All - Extract Information from PDF Bank Statements into JSON

<p>Extract Data from Bank Statements (PDF) into JSON files with the help of GPT4All / Llama3 LLM</p><ul><li><p>List PDFs from your drive that roughly have a similar layout and you expect an LLM to be able to extract data in a systematic way</p></li><li><p>Formulate a concise prompt (and instruction) and try to force the LLM to give back a JSON file with always the same structure (Mistral seems to be very good at that)</p></li><li><p>Use GPT4All wrapper to put document and query before the LLM</p></li><li><p>Collect the responses</p></li><li><p>Extract the data from JSON files, either with the help of Regex or just convert the JSON with KKNIME nodes</p></li><li><p>Make sure they have the same structure<br><br>=&gt; you need to have GPT4All installed and a suitable model downloaded to your "gpt4all_models" folder. You can choose the model then in the component</p></li></ul>

URL: Medium - Ollama and Llama3 - A Streamlit App to convert your files into local Vector Stores and chat with them using the latest LLMs https://medium.com/p/c5340fcd6ad0
URL: Medium - Chat with local Llama3 Model via Ollama in KNIME Analytics Platform - Also extract Logs into structured JSON Files https://medium.com/p/aca61e4a690a
URL: GitHub - A Streamlit App to convert your files into Vector Stores and chat with them with LLMs https://github.com/ml-score/ollama
URL: KNIME Spaces: KNIME for Generative AI https://hub.knime.com/-/collections/~D4ckx2q_J5FPBQXu/
URL: Download the whole LLM workflow group in order to get the folder https://hub.knime.com/mlauber71/spaces/LLM_Space/~17k4zAECNryrZw1X/
URL: GPT4All - Run Large Language Models Locally https://www.nomic.ai/gpt4all
URL: Medium: Llama3 and KNIME — Build your local Vector Store from PDFs and other Documents https://medium.com/low-code-for-advanced-data-science/llama3-and-knime-build-your-local-vector-store-from-pdfs-and-other-documents-237eda761c1c

Nodes

Extensions

Links