run-llama · AstraBert · Aug 8, 2025 · Jul 31, 2025 · Jul 31, 2025 · Jul 31, 2025
diff --git a/llama-index-integrations/llms/llama-index-llms-heroku/.gitignore b/llama-index-integrations/llms/llama-index-llms-heroku/.gitignore
@@ -0,0 +1,153 @@
+llama_index/_static
+.DS_Store
+# Byte-compiled / optimized / DLL files
+__pycache__/
+*.py[cod]
+*$py.class
+
+# C extensions
+*.so
+
+# Distribution / packaging
+.Python
+bin/
+build/
+develop-eggs/
+dist/
+downloads/
+eggs/
+.eggs/
+etc/
+include/
+lib/
+lib64/
+parts/
+sdist/
+share/
+var/
+wheels/
+pip-wheel-metadata/
+share/python-wheels/
+*.egg-info/
+.installed.cfg
+*.egg
+MANIFEST
+
+# PyInstaller
+#  Usually these files are written by a python script from a template
+#  before PyInstaller builds the exe, so as to inject date/other infos into it.
+*.manifest
+*.spec
+
+# Installer logs
+pip-log.txt
+pip-delete-this-directory.txt
+
+# Unit test / coverage reports
+htmlcov/
+.tox/
+.nox/
+.coverage
+.coverage.*
+.cache
+nosetests.xml
+coverage.xml
+*.cover
+*.py,cover
+.hypothesis/
+.pytest_cache/
+.ruff_cache
+
+# Translations
+*.mo
+*.pot
+
+# Django stuff:
+*.log
+local_settings.py
+db.sqlite3
+db.sqlite3-journal
+
+# Flask stuff:
+instance/
+.webassets-cache
+
+# Scrapy stuff:
+.scrapy
+
+# Sphinx documentation
+docs/_build/
+
+# PyBuilder
+target/
+
+# Jupyter Notebook
+.ipynb_checkpoints
+notebooks/
+
+# IPython
+profile_default/
+ipython_config.py
+
+# pyenv
+.python-version
+
+# pipenv
+#   According to pypa/pipenv#598, it is recommended to include Pipfile.lock in version control.
+#   However, in case of collaboration, if having platform-specific dependencies or dependencies
+#   having no cross-platform support, pipenv may install dependencies that don't work, or not
+#   install all needed dependencies.
+#Pipfile.lock
+
+# PEP 582; used by e.g. github.com/David-OConnor/pyflow
+__pypackages__/
+
+# Celery stuff
+celerybeat-schedule
+celerybeat.pid
+
+# SageMath parsed files
+*.sage.py
+
+# Environments
+.env
+.venv
+env/
+venv/
+ENV/
+env.bak/
+venv.bak/
+pyvenv.cfg
+
+# Spyder project settings
+.spyderproject
+.spyproject
+
+# Rope project settings
+.ropeproject
+
+# mkdocs documentation
+/site
+
+# mypy
+.mypy_cache/
+.dmypy.json
+dmypy.json
+
+# Pyre type checker
+.pyre/
+
+# Jetbrains
+.idea
+modules/
+*.swp
+
+# VsCode
+.vscode
+
+# pipenv
+Pipfile
+Pipfile.lock
+
+# pyright
+pyrightconfig.json
diff --git a/llama-index-integrations/llms/llama-index-llms-heroku/LICENSE b/llama-index-integrations/llms/llama-index-llms-heroku/LICENSE
@@ -0,0 +1,21 @@
+The MIT License
+
+Copyright (c) Jerry Liu
+
+Permission is hereby granted, free of charge, to any person obtaining a copy
+of this software and associated documentation files (the "Software"), to deal
+in the Software without restriction, including without limitation the rights
+to use, copy, modify, merge, publish, distribute, sublicense, and/or sell
+copies of the Software, and to permit persons to whom the Software is
+furnished to do so, subject to the following conditions:
+
+The above copyright notice and this permission notice shall be included in
+all copies or substantial portions of the Software.
+
+THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR
+IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,
+FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE
+AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER
+LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM,
+OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN
+THE SOFTWARE.
diff --git a/llama-index-integrations/llms/llama-index-llms-heroku/Makefile b/llama-index-integrations/llms/llama-index-llms-heroku/Makefile
@@ -0,0 +1,31 @@
+.PHONY: install
+install:
+	pip install -e .
+
+.PHONY: install-dev
+install-dev:
+	pip install -e ".[dev]"
+
+.PHONY: format
+format:
+	black llama_index tests
+	ruff check --fix llama_index tests
+
+.PHONY: lint
+lint:
+	ruff check llama_index tests
+	mypy llama_index
+
+.PHONY: test
+test:
+	pytest tests/ -v
+
+.PHONY: test-cov
+test-cov:
+	pytest tests/ --cov=llama_index --cov-report=term-missing
+
+.PHONY: clean
+clean:
+	rm -rf build/
+	rm -rf dist/
+	rm -rf *.egg-info
diff --git a/llama-index-integrations/llms/llama-index-llms-heroku/README.md b/llama-index-integrations/llms/llama-index-llms-heroku/README.md
@@ -0,0 +1,121 @@
+# Heroku Managed Inference
+
+The `llama-index-llms-heroku` package contains LlamaIndex integrations for building applications with models on Heroku's Managed Inference platform. This integration allows you to easily connect to and use AI models deployed on Heroku's infrastructure.
+
+## Installation
+
+```shell
+pip install llama-index-llms-heroku
+```
+
+## Setup
+
+### 1. Create a Heroku App
+
+First, create an app in Heroku:
+
+```bash
+heroku create $APP_NAME
+```
+
+### 2. Create and Attach AI Models
+
+Create and attach a chat model to your app:
+
+```bash
+heroku ai:models:create -a $APP_NAME claude-3-5-haiku
+```
+
+### 3. Export Configuration Variables
+
+Export the required configuration variables:
+
+```bash
+export INFERENCE_KEY=$(heroku config:get INFERENCE_KEY -a $APP_NAME)
+export INFERENCE_MODEL_ID=$(heroku config:get INFERENCE_MODEL_ID -a $APP_NAME)
+export INFERENCE_URL=$(heroku config:get INFERENCE_URL -a $APP_NAME)
+```
+
+## Usage
+
+### Basic Usage
+
+```python
+from llama_index.llms.heroku import Heroku
+from llama_index.core.llms import ChatMessage, MessageRole
+
+# Initialize the Heroku LLM
+llm = Heroku()
+
+# Create chat messages
+messages = [
+    ChatMessage(
+        role=MessageRole.SYSTEM, content="You are a helpful assistant."
+    ),
+    ChatMessage(
+        role=MessageRole.USER,
+        content="What are the most popular house pets in North America?",
+    ),
+]
+
+# Get response
+response = llm.chat(messages)
+print(response)
+```
+
+### Using Environment Variables
+
+The integration automatically reads from environment variables:
+
+```python
+import os
+
+# Set environment variables
+os.environ["INFERENCE_KEY"] = "your-inference-key"
+os.environ["INFERENCE_URL"] = "https://us.inference.heroku.com"
+os.environ["INFERENCE_MODEL_ID"] = "claude-3-5-haiku"
+
+# Initialize without parameters
+llm = Heroku()
+```
+
+### Using Parameters
+
+You can also pass parameters directly:
+
+```python
+import os
+
+llm = Heroku(
+    model=os.getenv("INFERENCE_MODEL_ID", "claude-3-5-haiku"),
+    api_key=os.getenv("INFERENCE_KEY", "your-inference-key"),
+    inference_url=os.getenv(
+        "INFERENCE_URL", "https://us.inference.heroku.com"
+    ),
+    max_tokens=1024,
+)
+```
+
+### Text Completion
+
+```python
+# Simple text completion
+response = llm.complete("Explain the importance of open source LLMs")
+print(response.text)
+```
+
+## Available Models
+
+For a complete list of available models, see the [Heroku Managed Inference documentation](https://devcenter.heroku.com/articles/heroku-inference#available-models).
+
+## Error Handling
+
+The integration includes proper error handling for common issues:
+
+- Missing API key
+- Invalid inference URL
+- Missing model configuration
+
+## Additional Information
+
+For more information about Heroku Managed Inference, visit the [official documentation](https://devcenter.heroku.com/articles/heroku-inference).