Anirud/v1 end to end testing (#70)

* update package json and package lock json - plus run npm audit fix * move rag folder into components folder * remove browser auto-fill on rag input * correct rag input error message colors * clear rag input field post submit * update deploy model functionality: - add dynamic button text, -disable logic - refresh handling * more updated to deploy model - to show previously selected model and weights * update models deployed table to refresh model context * Adds models context to refresh deployed models throughtout the app * add more model interfaces * Adds Models Provider * update animated deploy button * update navbar: - show chatui icon - greyed out when no models deployed - shown when a single or more model is deployed - updates the models provider so that other components can subscribe * update navabr - adds seperator - updates icon for models deployed * better error management when deployment fails * shows user when model is being deleted * remove model id from toast message * adds better animation when deploying a model * Pre select default weights: - add an animation when the component mounts to show the default weight is selected * removes finetuer option * fix: bugs in chatui page and improve layout: - Restructure chat message layout for better alignment - Implement max-width and text wrapping for long messages - Adjust chat bubble styling for consistent appearance - Enhance ScrollArea to improve spacing between messages - add tooltip in menu bar - adjust menu bar look and layout - adjust inferecne button to be better * Add better scroll down button when text is very large * feat (Custom Toaster ) - gets a info toast message * fix ts form errors * Add log view feature: ListLogsView and GetLogView endpoints * add log viewer views to test POC * WIP: Add log view feature: to frontend to test POC * WIP: log viewer - update frontend to accept tree file structure from backend * Enhance LogsViewer icons and add visual indicators - Add color to folder (yellow) and file (blue) icons - Introduce ExternalLink icon for log files on hover - Improve button layout for better visual hierarchy - Maintain existing functionality and component structure * fix(logs-viewer): ensure correct log file path handling in frontend + backend - Removed double-encoding of log file paths when constructing URLs for fetching logs - Improved log file tree rendering for better user experience when navigating directories * clean out log viewer feature * better button effects throughtout the app * fix syntax errors * Navbar icons get a new button variant * updates models deployed page for better icons and ux * fixed named exports in models deployed * fix import * update chatui component - add back scroll wheel - make it default export * switch to export default * update model config: to support llama3-70 - adds ModelImpl for llamma3-70b - sets WH_ARCH_YAML in case device is N150 or N300x4 * adds temperature and max tokens to the post req to the model * up streams new model config and uses intruct weights * overhaul chat component to accept infernce from lamma models and show stats * Append the custom end marker after the last chunk to ensure frontend reads all infernce chunks * updates tsconfig * adds infernce stat component * updates backgound * update to v1.0 * fix ts vite warnings fixes #61 * remove streaming info for now * refctor to use: - only folders that contain logs * refactor functions to new files with module scope * attempt to sort logs by date * added mistral to model config * removing old implemention in model config * Anirud/fix refactor chatui page (#64) * refactor(Header): Resolve ref warning and improve type safety - Extract ModelSelector as a separate forwardRef component - Fix TypeScript errors in DropdownMenu implementation - Improve overall type safety in Header component - fixes #62 * refactor ChatUI component - Break down large ChatComponent into smaller, reusable components in its own folder - Create separate Header, ChatHistory, and InputArea components - move chat components like stats, examples into its own folder called "chatui" * refactor: chatui history and examples - add scroll back which allows user to scoll to bottom of chat * keep focus on chat text area * Refactor ChatComponent for improved modularity - Move utility functions to separate files - Update imports to use new utility files - Simplify handleInference function * increase cycle time before moving to a new chat example * Create refactored runInference.ts for inference processing * Create getRagContext.ts for RAG functionality * Create types.ts with shared interfaces across chat ui * remove this unused component --------- Co-authored-by: Mithun Vanniasinghe <[email protected]>
tenstorrent · Oct 31, 2024 · d4a7689 · d4a7689
1 parent 678eecd
commit d4a7689
Show file tree

Hide file tree

Showing 47 changed files with 1,985 additions and 796 deletions.
diff --git a/app/api/api/settings.py b/app/api/api/settings.py
@@ -56,6 +56,7 @@
     "model_control",
     "vector_db_control",
     "corsheaders",
+    "logs_control",
 ]
 
 MIDDLEWARE = [

diff --git a/app/api/api/urls.py b/app/api/api/urls.py
@@ -30,4 +30,5 @@
     path("models/", include("model_control.urls")),
     path('reset_board/', include('docker_control.urls')),
     path("collections/", include("vector_db_control.urls")),
+    path("logs/", include("logs_control.urls")),
 ]
diff --git a/app/api/logs_control/__init__.py b/app/api/logs_control/__init__.py
@@ -0,0 +1,3 @@
+# SPDX-License-Identifier: Apache-2.0
+#
+# SPDX-FileCopyrightText: © 2024 Tenstorrent AI ULC
diff --git a/app/api/logs_control/urls.py b/app/api/logs_control/urls.py
@@ -0,0 +1,10 @@
+# SPDX-License-Identifier: Apache-2.0
+# SPDX-FileCopyrightText: © 2024 Tenstorrent AI ULC
+
+from django.urls import path
+from .views import ListLogsView, GetLogView
+
+urlpatterns = [
+    path('', ListLogsView.as_view(), name='list_logs'),
+    path('<path:filename>/', GetLogView.as_view(), name='get_log'),  # Use <path:filename> for multi-directory paths
+]
diff --git a/app/api/logs_control/views.py b/app/api/logs_control/views.py
@@ -0,0 +1,81 @@
+# SPDX-License-Identifier: Apache-2.0
+# SPDX-FileCopyrightText: © 2024 Tenstorrent AI ULC
+
+import os
+from urllib.parse import unquote
+from django.http import JsonResponse, HttpResponse, Http404
+from rest_framework.views import APIView
+from shared_config.logger_config import get_logger
+
+# Setting up logger
+logger = get_logger(__name__)
+
+# Use environment variable for the base storage volume
+LOGS_ROOT = os.getenv("INTERNAL_PERSISTENT_STORAGE_VOLUME", "/path/to/fallback")
+
+class ListLogsView(APIView):
+    """
+    Lists all available directories and log files within the base logs directory
+    """
+    def get(self, request, *args, **kwargs):
+        logger.info("ListLogsView endpoint hit")
+        try:
+            logs_tree = self.build_logs_tree(LOGS_ROOT)
+            logger.info(f"Log tree built: {logs_tree}")
+            return JsonResponse({'logs': logs_tree}, status=200)
+        except Exception as e:
+            logger.error(f"Error listing logs: {e}")
+            return JsonResponse({'error': str(e)}, status=500)
+
+    def build_logs_tree(self, directory):
+        """
+        Recursively build a tree of directories and files.
+        """
+        tree = []
+        for entry in os.listdir(directory):
+            path = os.path.join(directory, entry)
+            if os.path.isdir(path):
+                tree.append({
+                    "name": entry,
+                    "type": "directory",
+                    "children": self.build_logs_tree(path)
+                })
+            elif entry.endswith(".log"):
+                tree.append({
+                    "name": entry,
+                    "type": "file"
+                })
+        return tree
+
+
+class GetLogView(APIView):
+    """
+    Retrieves the content of a specific log file from the logs directory.
+    """
+    def get(self, request, filename, *args, **kwargs):
+
+        decoded_filename = unquote(filename)
+
+        file_path = os.path.normpath(os.path.join(LOGS_ROOT, decoded_filename))
+
+        # Security check: Ensure the resolved path is within LOGS_ROOT
+        if not file_path.startswith(os.path.abspath(LOGS_ROOT)):
+            logger.error(f"Invalid log file path: {file_path}")
+            raise Http404("Invalid file path.")
+
+        logger.info(f"Looking for log file at: {file_path}")
+
+
+        if os.path.exists(file_path) and os.path.isfile(file_path):
+            try:
+                with open(file_path, 'r') as file:
+                    content = file.read()
+                logger.info(f"Successfully retrieved content for log: {decoded_filename}")
+                return HttpResponse(content, content_type='text/plain')
+            except Exception as e:
+                logger.error(f"Error reading log file {decoded_filename}: {e}")
+                return JsonResponse({'error': str(e)}, status=500)
+        else:
+            logger.error(f"Log file {decoded_filename} not found at {file_path}")
+            raise Http404(f"Log file {decoded_filename} not found.")
+
diff --git a/app/api/model_control/model_utils.py b/app/api/model_control/model_utils.py
@@ -34,21 +34,34 @@ def get_all_records():
     data = get_all_records()
     return data
 
-
 def stream_response_from_external_api(url, json_data):
     logger.info(f"stream_response_from_external_api to: url={url}")
     try:
         headers = {"Authorization": f"Bearer {encoded_jwt}"}
+        logger.info(f"stream_response_from_external_api headers:={headers}")
+        logger.info(f"stream_response_from_external_api json_data:={json_data}")
+        json_data["temperature"] = 1
+        json_data["max_tokens"] = 128
+        logger.info(f"added extra token and temp!:={json_data}")
+
         with requests.post(
             url, json=json_data, headers=headers, stream=True, timeout=None
         ) as response:
+            logger.info(f"stream_response_from_external_api response:={response}")
             response.raise_for_status()
             logger.info(f"response.headers:={response.headers}")
             logger.info(f"response.encoding:={response.encoding}")
             # only allow HTTP 1.1 chunked encoding
             assert response.headers.get("transfer-encoding") == "chunked"
-            # Note: chunk_size=None must be passed or it will chunk single chars
+
+            # Stream chunks
             for chunk in response.iter_content(chunk_size=None, decode_unicode=True):
+                logger.info(f"stream_response_from_external_api chunk:={chunk}")
                 yield chunk
+
+            # Append the custom end marker after the last chunk
+            yield "<<END_OF_STREAM>>"  # Custom marker to signal end of stream
+
+            logger.info("stream_response_from_external done")
     except requests.RequestException as e:
         yield str(e)
diff --git a/app/api/shared_config/model_config.py b/app/api/shared_config/model_config.py
@@ -25,9 +25,7 @@ class ModelImpl:
     model_id: str
     image_name: str
     image_tag: str
-    device_configurations: Set[
-        "DeviceConfigurations"
-    ]  # Assuming DeviceConfigurations is an enum or similar
+    device_configurations: Set["DeviceConfigurations"]
     docker_config: Dict[str, Any]
     user_uid: int  # user inside docker container uid (for file permissions)
     user_gid: int  # user inside docker container gid (for file permissions)
@@ -41,6 +39,10 @@ def __post_init__(self):
         self.docker_config["environment"]["HF_HOME"] = Path(
             backend_config.model_container_cache_root
         ).joinpath("huggingface")
+
+        # Set environment variable if N150 or N300x4 is in the device configurations
+        if DeviceConfigurations.N150 in self.device_configurations or DeviceConfigurations.N300x4 in self.device_configurations:
+            self.docker_config["environment"]["WH_ARCH_YAML"] = "wormhole_b0_80_arch_eth_dispatch.yaml"
 
     @property
     def image_version(self) -> str:
@@ -150,9 +152,34 @@ def base_docker_config():
         service_port=7000,
         service_route="/inference/falcon7b",
     ),
+        ModelImpl(
+        model_name="Llama3.1-70bv0.0.1-instruct",
+        model_id="id_tt-metal-llama-3.1-70b-instructv0.0.1",
+        image_name="ghcr.io/tenstorrent/tt-inference-server/tt-metal-llama3-70b-src-base-inference",
+        image_tag="v0.0.1-tt-metal-v0.52.0-rc31-9d3be887987b",
+        device_configurations={DeviceConfigurations.N300x4},
+        docker_config=base_docker_config(),
+        user_uid=1000,
+        user_gid=1000,
+        shm_size="32G",
+        service_port=7000,
+        service_route="/inference/llama3-70b",
+    ),
+    ModelImpl(
+        model_name="Mistral7B-instruct-v0.2",
+        model_id="id_tt-metal-mistral-7bv0.0.2",
+        image_name="ghcr.io/tenstorrent/tt-inference-server/tt-metal-mistral-7b-src-base",
+        image_tag="v0.0.3-tt-metal-v0.52.0-rc33",
+        device_configurations={DeviceConfigurations.N300x4},
+        docker_config=base_docker_config(),
+        user_uid=1000,
+        user_gid=1000,
+        shm_size="32G",
+        service_port=7000,
+        service_route="/inference/mistral7b",
+    )
 ]
 
-
 def validate_model_implemenation_config(impl):
     # no / in model_id strings, model_id will be used in path names
     assert not "/" in impl.model_id

diff --git a/app/frontend/index.html b/app/frontend/index.html
@@ -1,10 +1,10 @@
-<!DOCTYPE html>
+<!doctype html>
 <html lang="en">
   <head>
     <meta charset="UTF-8" />
     <link rel="icon" type="image/svg+xml" href="./src/assets/tt_logo.svg" />
     <meta name="viewport" content="width=device-width, initial-scale=1.0" />
-    <title>llm studio v0.0</title>
+    <title>llm studio v1.0</title>
   </head>
   <body>
     <div id="root"></div>