Skip to content

Commit

Permalink
Cleanup
Browse files Browse the repository at this point in the history
  • Loading branch information
aravind10x committed Dec 30, 2024
1 parent d1f044f commit 1d79b77
Show file tree
Hide file tree
Showing 17 changed files with 62 additions and 48,555 deletions.
3 changes: 0 additions & 3 deletions Brewfile

This file was deleted.

75 changes: 62 additions & 13 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -44,7 +44,7 @@ See other installation options here ([link](https://docs.ragbuilder.io/quickstar
```python
from ragbuilder import RAGBuilder

# Initialize and optimize
# Initialize and optimize with defaults
builder = RAGBuilder.from_source_with_defaults(input_source='https://lilianweng.github.io/posts/2023-06-23-agent/')
results = builder.optimize()

Expand All @@ -55,6 +55,29 @@ response = results.invoke("What is HNSW?")
print(results.summary())
```

### Setting Default Models

You can specify default LLM and embedding models that will be used throughout the pipeline:

`````python
from langchain_openai import AzureChatOpenAI, AzureOpenAIEmbeddings

# Initialize with custom defaults
builder = RAGBuilder.from_source_with_defaults(
input_source='data.pdf',
default_llm=AzureChatOpenAI(model="gpt-4o", temperature=0.0),
default_embeddings=AzureOpenAIEmbeddings(model="text-embedding-3-large"),
n_trials=20 # Set number of optimization trials
)

# Or when creating a RAGBuilder instance with fine grained custom configuration
builder = RAGBuilder(
data_ingest_config=data_ingest_config, # Custom Data Ingestion parameters
default_llm=AzureChatOpenAI(model="gpt-4o", temperature=0.0),
default_embeddings=AzureOpenAIEmbeddings(model="text-embedding-3-large")
)
`````

## Configuration Guide

### Basic Configuration
Expand Down Expand Up @@ -96,7 +119,18 @@ data_ingest_config = DataIngestOptionsConfig(
}]
)

# Configure retrieval
# Initialize with custom configs
builder = RAGBuilder(
data_ingest_config=data_ingest_config,
default_llm=AzureChatOpenAI(model="gpt-4o", temperature=0.0),
default_embeddings=AzureOpenAIEmbeddings(model="text-embedding-3-large")
)

# Run individual module level optimization
builder.optimize_data_ingest()


# Configure retrieval options
retrieval_config = RetrievalOptionsConfig(
retrievers=[
{
Expand All @@ -116,25 +150,40 @@ retrieval_config = RetrievalOptionsConfig(
top_k=[3, 5]
)

# Initialize with custom configs
builder = RAGBuilder(
data_ingest_config=data_ingest_config,
retrieval_config=retrieval_config

# Run retrieval optimization with custom config
builder.optimize_retrieval(retrieval_config)

# Configure Generation related options
gen_config = GenerationOptionsConfig(
llms = [
LLMConfig(type="azure_openai", model_kwargs={'model':'gpt-4o-mini', 'temperature':0.2}),
LLMConfig(type="azure_openai", model_kwargs={'model':'gpt-4o', 'temperature':0.2}),
],
optimization={
"n_trials": 10,
"n_jobs": 1,
"study_name": "lillog_agents_study",
"optimization_direction": "maximize"
},
evaluation_config={"type": "ragas"},
)

# Access individual components
vectorstore = results.data_ingest.get_vectorstore()
docs = results.retrieval.invoke("What is RAG?")
answer = results.generation.invoke("What is RAG?")
# Run generation optimization with custom config
builder.optimize_generation(gen_config)

results = builder.optimization_results
response = adv_results.invoke("What is HNSW?")
````

## Component Reference

## Component Options Reference

### Document Loaders
- `pymupdf`: Optimized for PDFs
- `unstructured`: General-purpose loader
- `pymupdf`: Optimized for PDFs
- `pypdf`: Alternative PDF loader
- `bs4`: Web page loader
- `web`: Web page loader
- Custom loaders via `custom_class`

### Chunking Strategies
Expand Down
89 changes: 0 additions & 89 deletions basic_llm_call.ipynb

This file was deleted.

44 changes: 0 additions & 44 deletions call_invoke.ipynb

This file was deleted.

44 changes: 0 additions & 44 deletions demo copy.yaml

This file was deleted.

44 changes: 0 additions & 44 deletions demo.yaml

This file was deleted.

48 changes: 0 additions & 48 deletions demo_2.yaml

This file was deleted.

Loading

0 comments on commit 1d79b77

Please sign in to comment.