> ## Documentation Index
> Fetch the complete documentation index at: https://docs.pdf4llm.com/llms.txt
> Use this file to discover all available pages before exploring further.

# Installation

> Install PyMuPDF4LLM and its optional dependencies.

<div id="apiIndicatorBadge">
  <div class="inner pymupdf" />
</div>

## Requirements

PyMuPDF4LLM requires **Python 3.8+**. It is built on top of [PyMuPDF](https://pymupdf.readthedocs.io/), which is installed automatically as a dependency.

***

## Basic Installation

Install PyMuPDF4LLM from PyPI using pip:

```bash theme={null}
pip install pymupdf4llm
```

This gives you full access to Markdown, JSON, and plain text extraction from document files.

***

## Optional Dependencies

### OCR Support

Enables automatic Optical Character Recognition for PDFs containing scanned or image-based content. [Tesseract](https://github.com/tesseract-ocr/tesseract) is included by default.

Support for [Rapid OCR](https://github.com/RapidAI/RapidOCR) and [Paddle OCR](https://github.com/PaddlePaddle/PaddleOCR) is also available as optional OCR engines and should be installed if required.

<Warning>
  OCR is only triggered automatically when PyMuPDF4LLM detects that a page that requires it.

  See:

  * [Hybrid OCR Strategy](/python/guides/OCR#hybrid-ocr-strategy)
  * [How OCR is Triggered](/python/guides/OCR#how-ocr-is-triggered)
</Warning>

***

## Verify Your Installation

```python theme={null}
import pymupdf4llm

print(pymupdf4llm.version)
```

***

## Next Steps

<CardGroup cols={2}>
  <Card title="Quickstart" icon="bolt" href="/python/getting-started/quickstart">
    Convert your first PDF to Markdown in a few lines.
  </Card>

  <Card title="Supported Formats" icon="file" href="/python/getting-started/supported-formats">
    See all supported input and output formats.
  </Card>
</CardGroup>
