langbrain/LangChain.md

34 lines
1.1 KiB
Markdown
Raw Blame History

This file contains invisible Unicode characters

This file contains invisible Unicode characters that are indistinguishable to humans but may be processed differently by a computer. If you think that this is intentional, you can safely ignore this warning. Use the Escape button to reveal them.

```
```
> [Obsidian](https://obsidian.md/) is a powerful and extensible knowledge base that works on top of your local folder of plain text files.
This notebook covers how to load documents from an `Obsidian` database.
Since `Obsidian` is just stored on disk as a folder of Markdown files, the loader just takes a path to this directory.
`Obsidian` files also sometimes contain [metadata](https://help.obsidian.md/Editing+and+formatting/Metadata) which is a YAML block at the top of the file. These values will be added to the document's metadata. (`ObsidianLoader` can also be passed a `collect_metadata=False` argument to disable this behavior.)
```
from langchain.document_loaders import ObsidianLoader
```
```
loader = ObsidianLoader("<path-to-obsidian>")
```
```
docs = loader.load()
```
[source](https://python.langchain.com/docs/integrations/document_loaders/obsidian)
## Document loaders
[source](https://python.langchain.com/docs/modules/data_connection/document_loaders/)
## MarkdownHeaderTextSplitter
[source](https://python.langchain.com/docs/modules/data_connection/document_transformers/text_splitters/markdown_header_metadata)