NLS Search API Examples

Natural Language Search

Search for documents using natural language queries:

# Basic natural language search
curl -X POST "http://localhost:8000/search" \
  -H "Content-Type: application/json" \
  -d '{
    "text": "What are the safety procedures for chemical handling?",
    "max_results": 5
  }'

# Search with specific provider
curl -X POST "http://localhost:8000/search" \
  -H "Content-Type: application/json" \
  -d '{
    "text": "Show me documentation about error handling",
    "provider": "ollama",
    "max_results": 3
  }'

Response:

{
  "results": [
    {
      "id": "doc1",
      "content": "Chemical safety procedures require proper PPE including...",
      "metadata": {
        "category": "safety",
        "department": "laboratory"
      },
      "score": 0.89
    }
  ]
}

Document Indexing

Index Single Document

curl -X POST "http://localhost:8000/index" \
  -H "Content-Type: application/json" \
  -d '{
    "id": "unique_doc_id",
    "content": "This is the document content that will be searchable using natural language queries.",
    "metadata": {
      "category": "documentation",
      "author": "John Doe",
      "date": "2024-03-03"
    }
  }'

Response:

{
  "success": true
}

Bulk Index from MongoDB

Index multiple documents from a MongoDB collection:

curl -X POST "http://localhost:8000/bulk-index" \
  -H "Content-Type: application/json" \
  -d '{
    "collection_name": "documents",
    "aggregation_pipeline": [
      {"$match": {"status": "active"}},
      {"$limit": 1000}
    ],
    "id_field": "_id",
    "content_field": "text",
    "metadata_fields": ["category", "author", "tags"],
    "batch_size": 100
  }'

Response:

{
  "indexed_count": 150,
  "error_count": 0,
  "elapsed_time": 25.5,
  "rate": 5.88,
  "errors": []
}

Delete Document

Delete a document by its ID:

curl -X DELETE "http://localhost:8000/documents/{document_id}" \
  -H "Content-Type: application/json"

Response:

{
  "success": true
}

Update Document

Update an existing document:

curl -X PUT "http://localhost:8000/documents/{document_id}" \
  -H "Content-Type: application/json" \
  -d '{
    "content": "Updated document content that will be re-embedded and searchable.",
    "metadata": {
      "category": "documentation",
      "author": "Jane Doe",
      "last_updated": "2024-03-04"
    }
  }'

Response:

{
  "success": true
}

Note: When updating a document, the system will:

Generate new embeddings for the updated content
Replace the old document's embeddings and metadata
Maintain the same document ID

Natural Language Query Examples

Here are some example queries that demonstrate the semantic search capabilities:

# Conceptual search
curl -X POST "http://localhost:8000/search" \
  -d '{
    "text": "What are best practices for data security?",
    "max_results": 5
  }'

# Question-based search
curl -X POST "http://localhost:8000/search" \
  -d '{
    "text": "How do I handle customer complaints?",
    "max_results": 3
  }'

# Topic exploration
curl -X POST "http://localhost:8000/search" \
  -d '{
    "text": "Tell me about project management methodologies",
    "max_results": 5
  }'

The system will understand these queries semantically and return relevant documents, even if they don't contain the exact words but cover the same concepts.

Configuration Examples

config.yaml

vector_db:
  type: qdrant
  vector_size: 384  # Matches Ollama's all-minilm model

providers:
  ollama:
    enabled: true
    url: ${OLLAMA_URL}
    model: ${OLLAMA_MODEL}
    embedding_model: ${OLLAMA_EMBEDDING_MODEL}
    vector_size: 384

search:
  default_provider: ${DEFAULT_PROVIDER}
  max_results: 10
  similarity_threshold: 0.3  # Lower values = more results but potentially less relevant

Environment Variables (.env)

# API Settings
APP_HOST=0.0.0.0
APP_PORT=8000
DEBUG=true

# Vector DB
QDRANT_HOST=localhost
QDRANT_PORT=6333
QDRANT_COLLECTION=documents

# Provider Settings
DEFAULT_PROVIDER=ollama
OLLAMA_URL=http://localhost:11434
OLLAMA_MODEL=llama2
OLLAMA_EMBEDDING_MODEL=all-minilm

Error Handling

The API returns clear error messages:

{
  "detail": "Vector size mismatch: Provider 'ollama' generated embedding of size 384, but vector DB expects 768"
}

{
  "detail": "Search query cannot be empty"
}

Best Practices

Natural Language Queries
- Use complete sentences or questions
- Be specific but natural in your queries
- Include relevant context
Document Indexing
- Provide meaningful content for embedding
- Include relevant metadata
- Use batch processing for large datasets
Configuration
- Match vector sizes between provider and database
- Adjust similarity threshold based on needs
- Monitor and tune based on results

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

EXAMPLES.md

EXAMPLES.md

NLS Search API Examples

Natural Language Search

Document Indexing

Index Single Document

Bulk Index from MongoDB

Delete Document

Update Document

Natural Language Query Examples

Configuration Examples

config.yaml

Environment Variables (.env)

Error Handling

Best Practices

Files

EXAMPLES.md

Latest commit

History

EXAMPLES.md

File metadata and controls

NLS Search API Examples

Natural Language Search

Document Indexing

Index Single Document

Bulk Index from MongoDB

Delete Document

Update Document

Natural Language Query Examples

Configuration Examples

config.yaml

Environment Variables (.env)

Error Handling

Best Practices