langbrain/LangChain.md

34 lines
1.1 KiB
Markdown
Raw Permalink Blame History

This file contains invisible Unicode characters

This file contains invisible Unicode characters that are indistinguishable to humans but may be processed differently by a computer. If you think that this is intentional, you can safely ignore this warning. Use the Escape button to reveal them.

```
```
> [Obsidian](https://obsidian.md/) is a powerful and extensible knowledge base that works on top of your local folder of plain text files.
This notebook covers how to load documents from an `Obsidian` database.
Since `Obsidian` is just stored on disk as a folder of Markdown files, the loader just takes a path to this directory.
`Obsidian` files also sometimes contain [metadata](https://help.obsidian.md/Editing+and+formatting/Metadata) which is a YAML block at the top of the file. These values will be added to the document's metadata. (`ObsidianLoader` can also be passed a `collect_metadata=False` argument to disable this behavior.)
```
from langchain.document_loaders import ObsidianLoader
```
```
loader = ObsidianLoader("<path-to-obsidian>")
```
```
docs = loader.load()
```
[source](https://python.langchain.com/docs/integrations/document_loaders/obsidian)
## Document loaders
[source](https://python.langchain.com/docs/modules/data_connection/document_loaders/)
## MarkdownHeaderTextSplitter
[source](https://python.langchain.com/docs/modules/data_connection/document_transformers/text_splitters/markdown_header_metadata)