Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Perform scraping of webpages without web elements in vectordb #21

Merged
merged 2 commits into from
Aug 27, 2024

Conversation

virajmalia
Copy link
Owner

@virajmalia virajmalia commented Aug 27, 2024

Changes

  1. This PR adds a web crawler that crawls child and related pages through href links on the parent page.
  2. It also scrapes content and cleans up the excess tokens related to webpage elements before creating a vector database.
  3. Cleans up the storage and retrieval process.
INFO:sentence_transformers.SentenceTransformer:Load pretrained SentenceTransformer: sentence-transformers/all-mpnet-base-v2
/home/virajm/.local/lib/python3.10/site-packages/huggingface_hub/file_download.py:1132: FutureWarning: `resume_download` is deprecated and will be removed in version 1.0.0. Downloads always resume when possible. If you want to force a new download, use `force_download=True`.
  warnings.warn(
/home/virajm/.local/lib/python3.10/site-packages/huggingface_hub/file_download.py:1132: FutureWarning: `resume_download` is deprecated and will be removed in version 1.0.0. Downloads always resume when possible. If you want to force a new download, use `force_download=True`.
  warnings.warn(
INFO:sentence_transformers.SentenceTransformer:Use pytorch device_name: cuda
INFO:root:Crawled https://en.wikipedia.org/wiki/Potato, found 391 links
INFO:root:Crawled https://en.wikipedia.org/wiki/Potato#cite_note-Converse_1908-126, found 0 links
INFO:root:Crawled https://en.wikipedia.org/wiki/Potato#cite_note-127, found 0 links
INFO:root:Crawled https://en.wikipedia.org/wiki/Potato#cite_note-128, found 0 links
INFO:root:Crawled https://en.wikipedia.org/wiki/Potato#cite_note-vgg-129, found 0 links
INFO:root:Crawled https://en.wikipedia.org/wiki/Potato#cite_note-William_Johnston-130, found 0 links
INFO:root:Crawled https://en.wikipedia.org/wiki/Potato#cite_note-VAC-131, found 0 links
INFO:root:Crawled https://en.wikipedia.org/wiki/Potato#cite_note-historyofhasbro-132, found 0 links
INFO:root:Crawled https://en.wikipedia.org/wiki/Potato#cite_note-WTToys-133, found 0 links
INFO:root:Crawled https://en.wikipedia.org/wiki/Potato#cite_note-134, found 0 links
INFO:root:Crawled https://en.wikipedia.org/wiki/Potato#cite_ref-1, found 0 links
INFO:root:Crawled https://en.wikipedia.org/wiki/Potato#cite_ref-2, found 0 links
INFO:root:Crawled https://en.wikipedia.org/wiki/Potato#cite_ref-3, found 0 links
INFO:root:Crawled https://en.wikipedia.org/wiki/Potato#cite_ref-4, found 0 links
INFO:root:Crawled https://en.wikipedia.org/wiki/Potato#cite_ref-5, found 0 links
INFO:root:Crawled https://en.wikipedia.org/wiki/Potato#cite_ref-Kew_6-0, found 0 links
INFO:root:Crawled https://en.wikipedia.org/wiki/Potato#cite_ref-Ewing_Struik_1992_7-0, found 0 links
INFO:root:Crawled https://en.wikipedia.org/wiki/Potato#cite_ref-8, found 0 links
INFO:root:Crawled https://en.wikipedia.org/wiki/Potato#cite_ref-Plaisted_9-0, found 0 links
INFO:root:Crawled https://en.wikipedia.org/wiki/Potato#cite_ref-10, found 0 links
INFO:root:Crawled https://en.wikipedia.org/wiki/Potato#cite_ref-Raker_Spooner_2002_11-0, found 0 links
INFO:root:Crawled https://en.wikipedia.org/wiki/Potato#cite_ref-Raker_Spooner_2002_11-1, found 0 links
INFO:root:Crawled https://en.wikipedia.org/wiki/Potato#cite_ref-Rodríguez_12-0, found 0 links
INFO:root:Crawled https://en.wikipedia.org/wiki/Potato#cite_ref-Rodríguez_12-1, found 0 links
INFO:root:Crawled https://en.wikipedia.org/wiki/Potato#cite_ref-13, found 0 links
INFO:root:Crawled https://en.wikipedia.org/wiki/Potato#cite_ref-Spooner_2005_14694–99_14-0, found 0 links
INFO:root:Crawled https://en.wikipedia.org/wiki/Potato#cite_ref-Spooner_2005_14694–99_14-1, found 0 links
INFO:root:Crawled https://en.wikipedia.org/wiki/Potato#cite_ref-Spooner_2005_14694–99_14-2, found 0 links
INFO:root:Crawled https://en.wikipedia.org/wiki/Potato#cite_ref-LostCrops_15-0, found 0 links
INFO:root:Crawled https://en.wikipedia.org/wiki/Potato#cite_ref-LostCrops_15-1, found 0 links
INFO:root:Crawled https://en.wikipedia.org/wiki/Potato#cite_ref-LostCrops_15-2, found 0 links
INFO:root:Crawled https://en.wikipedia.org/wiki/Potato#cite_ref-John_Michael_Francis_2005_16-0, found 0 links
INFO:root:Crawled https://en.wikipedia.org/wiki/Potato#cite_ref-John_Michael_Francis_2005_16-1, found 0 links
INFO:root:Crawled https://en.wikipedia.org/wiki/Potato#cite_ref-John_Michael_Francis_2005_16-2, found 0 links
INFO:root:Crawled https://en.wikipedia.org/wiki/Potato#cite_ref-17, found 0 links
INFO:root:Crawled https://en.wikipedia.org/wiki/Potato#cite_ref-18, found 0 links
INFO:root:Crawled https://en.wikipedia.org/wiki/Potato#cite_ref-19, found 0 links
INFO:root:Crawled https://en.wikipedia.org/wiki/Potato#cite_ref-Sauer-2017_20-0, found 0 links
INFO:root:Crawled https://en.wikipedia.org/wiki/Potato#cite_ref-Sauer-2017_20-1, found 0 links
INFO:root:Crawled https://en.wikipedia.org/wiki/Potato#cite_ref-Sauer-2017_20-2, found 0 links
INFO:root:Crawled https://en.wikipedia.org/wiki/Potato#cite_ref-21, found 0 links
INFO:root:Crawled https://en.wikipedia.org/wiki/Potato#cite_ref-PlDis2011_22-0, found 0 links
INFO:root:Crawled https://en.wikipedia.org/wiki/Potato#cite_ref-PlDis2011_22-1, found 0 links
INFO:root:Crawled https://en.wikipedia.org/wiki/Potato#cite_ref-PlDis2011_22-2, found 0 links
INFO:root:Crawled https://en.wikipedia.org/wiki/Potato#cite_ref-23, found 0 links
INFO:root:Crawled https://en.wikipedia.org/wiki/Potato#cite_ref-24, found 0 links
INFO:root:Crawled https://en.wikipedia.org/wiki/Potato#cite_ref-Ames2008_25-0, found 0 links
INFO:root:Crawled https://en.wikipedia.org/wiki/Potato#cite_ref-26, found 0 links
INFO:root:Crawled https://en.wikipedia.org/wiki/Potato#cite_ref-Genes_27-0, found 0 links
INFO:root:Crawled https://en.wikipedia.org/wiki/Potato#cite_ref-Resources_28-0, found 0 links
INFO:root:Crawled https://en.wikipedia.org/wiki/Potato#cite_ref-Resources_28-1, found 0 links
INFO:root:Crawled https://en.wikipedia.org/wiki/Potato#cite_ref-UN_Potato_Day_29-0, found 0 links
INFO:root:Crawled https://en.wikipedia.org/wiki/Potato#cite_ref-Neofunctionalisation_30-0, found 0 links
INFO:root:Crawled https://en.wikipedia.org/wiki/Potato#cite_ref-Neofunctionalisation_30-1, found 0 links
INFO:root:Crawled https://en.wikipedia.org/wiki/Potato#cite_ref-Hosaka_Hanneman,_Jr._1998_pp._191–197_31-0, found 0 links
INFO:root:Crawled https://en.wikipedia.org/wiki/Potato#cite_ref-32, found 0 links
INFO:root:Crawled https://en.wikipedia.org/wiki/Potato#cite_ref-Lindhout_Meijer_Schotte_Hutten_2011_pp._301–312_33-0, found 0 links
INFO:root:Crawled https://en.wikipedia.org/wiki/Potato#cite_ref-Strategies_34-0, found 0 links
INFO:root:Crawled https://en.wikipedia.org/wiki/Potato#cite_ref-35, found 0 links
INFO:root:Crawled https://en.wikipedia.org/wiki/Potato#cite_ref-36, found 0 links
INFO:root:Crawled https://en.wikipedia.org/wiki/Potato#cite_ref-37, found 0 links
INFO:root:Crawled https://en.wikipedia.org/wiki/Potato#cite_ref-38, found 0 links
INFO:root:Crawled https://en.wikipedia.org/wiki/Potato#cite_ref-39, found 0 links
INFO:root:Crawled https://en.wikipedia.org/wiki/Potato#cite_ref-40, found 0 links
INFO:root:Crawled https://en.wikipedia.org/wiki/Potato#cite_ref-recipe_tips_41-0, found 0 links
INFO:root:Crawled https://en.wikipedia.org/wiki/Potato#cite_ref-42, found 0 links
INFO:root:Crawled https://en.wikipedia.org/wiki/Potato#cite_ref-43, found 0 links
INFO:root:Crawled https://en.wikipedia.org/wiki/Potato#cite_ref-44, found 0 links
INFO:root:Crawled https://en.wikipedia.org/wiki/Potato#cite_ref-Hirsch_45-0, found 0 links
INFO:root:Crawled https://en.wikipedia.org/wiki/Potato#cite_ref-46, found 0 links
INFO:root:Crawled https://en.wikipedia.org/wiki/Potato#cite_ref-47, found 0 links
INFO:root:Crawled https://en.wikipedia.org/wiki/Potato#cite_ref-48, found 0 links
INFO:root:Crawled https://en.wikipedia.org/wiki/Potato#cite_ref-49, found 0 links
INFO:root:Crawled https://en.wikipedia.org/wiki/Potato#cite_ref-50, found 0 links
INFO:     127.0.0.1:53676 - "GET /llama4u/c/N4Igxg9gdgZglgcwK4CcCGAjANgUxALlABMIwB9VLAkACwBc6AHAZ3wHo2coA6AdzgDWcRjiJw03CCgRt%2BQtgAUIdNHQggAvhqA/input_schema HTTP/1.1" 200 OK
INFO:root:Crawled https://en.wikipedia.org/wiki/Potato#cite_ref-Receptor-Mediated_51-0, found 0 links
INFO:root:Crawled https://en.wikipedia.org/wiki/Potato#cite_ref-Receptor-Mediated_51-1, found 0 links
INFO:root:Crawled https://en.wikipedia.org/wiki/Potato#cite_ref-52, found 0 links
INFO:root:Crawled https://en.wikipedia.org/wiki/Potato#cite_ref-53, found 0 links
INFO:root:Crawled https://en.wikipedia.org/wiki/Potato#cite_ref-54, found 0 links
INFO:root:Crawled https://en.wikipedia.org/wiki/Potato#cite_ref-55, found 0 links
INFO:root:Crawled https://en.wikipedia.org/wiki/Potato#cite_ref-56, found 0 links
INFO:root:Crawled https://en.wikipedia.org/wiki/Potato#cite_ref-57, found 0 links
INFO:root:Crawled https://en.wikipedia.org/wiki/Potato#cite_ref-nytimes1_58-0, found 0 links
INFO:root:Crawled https://en.wikipedia.org/wiki/Potato#cite_ref-US_Potato_Board_-_Seed_Potatoes_59-0, found 0 links
INFO:root:Crawled https://en.wikipedia.org/wiki/Potato#cite_ref-60, found 0 links
INFO:root:Crawled https://en.wikipedia.org/wiki/Potato#cite_ref-61, found 0 links
INFO:root:Crawled https://en.wikipedia.org/wiki/Potato#cite_ref-JefferiesLawson1991_62-0, found 0 links
INFO:root:Crawled https://en.wikipedia.org/wiki/Potato#cite_ref-cornell1_63-0, found 0 links
INFO:root:Crawled https://en.wikipedia.org/wiki/Potato#cite_ref-64, found 0 links
INFO:root:Crawled https://en.wikipedia.org/wiki/Potato#cite_ref-RHS_planting_65-0, found 0 links
INFO:root:Crawled https://en.wikipedia.org/wiki/Potato#cite_ref-66, found 0 links
INFO:root:Crawled https://en.wikipedia.org/wiki/Potato#cite_ref-67, found 0 links
INFO:root:Crawled https://en.wikipedia.org/wiki/Potato#cite_ref-68, found 0 links
INFO:root:Crawled https://en.wikipedia.org/wiki/Potato#cite_ref-69, found 0 links
INFO:root:Crawled https://en.wikipedia.org/wiki/Potato#cite_ref-Alyokhin_70-0, found 0 links
INFO:root:Crawled https://en.wikipedia.org/wiki/Potato#cite_ref-71, found 0 links
INFO:root:Crawled https://en.wikipedia.org/wiki/Potato#cite_ref-Johnson_Auat_Cheein_2023_72-0, found 0 links
INFO:root:Crawled https://en.wikipedia.org/wiki/Potato#cite_ref-73, found 0 links
INFO:root:Crawled https://en.wikipedia.org/wiki/Potato#cite_ref-crosstree_74-0, found 0 links
INFO:root:Crawled https://en.wikipedia.org/wiki/Potato#cite_ref-crosstree_74-1, found 0 links
INFO:root:Crawled https://en.wikipedia.org/wiki/Potato#cite_ref-crosstree_74-2, found 0 links
INFO:root:Crawled https://en.wikipedia.org/wiki/Potato#cite_ref-cruk_75-0, found 0 links
INFO:root:Crawled https://en.wikipedia.org/wiki/Potato#cite_ref-epp_76-0, found 0 links
INFO:root:Crawled https://en.wikipedia.org/wiki/Potato#cite_ref-epp_76-1, found 0 links
INFO:root:Crawled https://en.wikipedia.org/wiki/Potato#cite_ref-fao_77-0, found 0 links
INFO:root:Crawled https://en.wikipedia.org/wiki/Potato#cite_ref-78, found 0 links
INFO:root:Crawled https://en.wikipedia.org/wiki/Potato#cite_ref-yield2010_79-0, found 0 links
INFO:root:Crawled https://en.wikipedia.org/wiki/Potato#cite_ref-yield2010_79-1, found 0 links
INFO:root:Crawled https://en.wikipedia.org/wiki/Potato#cite_ref-80, found 0 links
INFO:root:Crawled https://en.wikipedia.org/wiki/Potato#cite_ref-81, found 0 links
INFO:root:Crawled https://en.wikipedia.org/wiki/Potato#cite_ref-82, found 0 links
INFO:root:Crawled https://en.wikipedia.org/wiki/Potato#cite_ref-83, found 0 links
INFO:root:Crawled https://en.wikipedia.org/wiki/Potato#cite_ref-84, found 0 links
INFO:root:Crawled https://en.wikipedia.org/wiki/Potato#cite_ref-Ensminger_85-0, found 0 links
INFO:root:Crawled https://en.wikipedia.org/wiki/Potato#cite_ref-supply_86-0, found 0 links
INFO:root:Crawled https://en.wikipedia.org/wiki/Potato#cite_ref-supply_86-1, found 0 links
INFO:root:Crawled https://en.wikipedia.org/wiki/Potato#cite_ref-supply_86-2, found 0 links
INFO:root:Crawled https://en.wikipedia.org/wiki/Potato#cite_ref-supply_86-3, found 0 links
INFO:root:Crawled https://en.wikipedia.org/wiki/Potato#cite_ref-supply_86-4, found 0 links
INFO:root:Crawled https://en.wikipedia.org/wiki/Potato#cite_ref-supply_86-5, found 0 links
INFO:root:Crawled https://en.wikipedia.org/wiki/Potato#cite_ref-supply_86-6, found 0 links
INFO:root:Crawled https://en.wikipedia.org/wiki/Potato#cite_ref-supply_86-7, found 0 links
INFO:root:Crawled https://en.wikipedia.org/wiki/Potato#cite_ref-supply_86-8, found 0 links
INFO:root:Crawled https://en.wikipedia.org/wiki/Potato#cite_ref-supply_86-9, found 0 links
INFO:root:Crawled https://en.wikipedia.org/wiki/Potato#cite_ref-Zhao2017_87-0, found 0 links
INFO:root:Crawled https://en.wikipedia.org/wiki/Potato#cite_ref-88, found 0 links
INFO:root:Crawled https://en.wikipedia.org/wiki/Potato#cite_ref-Luck-et-al-2011_89-0, found 0 links
INFO:root:Crawled https://en.wikipedia.org/wiki/Potato#cite_ref-UK_90-0, found 0 links
INFO:root:Crawled https://en.wikipedia.org/wiki/Potato#cite_ref-UK_90-1, found 0 links
INFO:root:Crawled https://en.wikipedia.org/wiki/Potato#cite_ref-91, found 0 links
INFO:root:Crawled https://en.wikipedia.org/wiki/Potato#cite_ref-92, found 0 links
INFO:root:Crawled https://en.wikipedia.org/wiki/Potato#cite_ref-global_93-0, found 0 links
INFO:root:Crawled https://en.wikipedia.org/wiki/Potato#cite_ref-global_93-1, found 0 links
INFO:root:Crawled https://en.wikipedia.org/wiki/Potato#cite_ref-global_93-2, found 0 links
INFO:root:Crawled https://en.wikipedia.org/wiki/Potato#cite_ref-global_93-3, found 0 links
INFO:root:Crawled https://en.wikipedia.org/wiki/Potato#cite_ref-Levy_94-0, found 0 links
INFO:root:Crawled https://en.wikipedia.org/wiki/Potato#cite_ref-Levy_94-1, found 0 links
INFO:root:Crawled https://en.wikipedia.org/wiki/Potato#cite_ref-95, found 0 links
INFO:root:Crawled https://en.wikipedia.org/wiki/Potato#cite_ref-96, found 0 links
INFO:root:Crawled https://en.wikipedia.org/wiki/Potato#cite_ref-97, found 0 links
INFO:root:Crawled https://en.wikipedia.org/wiki/Potato#cite_ref-98, found 0 links
INFO:root:Crawled https://en.wikipedia.org/wiki/Potato#cite_ref-99, found 0 links
INFO:root:Crawled https://en.wikipedia.org/wiki/Potato#cite_ref-FDADailyValues_100-0, found 0 links
INFO:root:Crawled https://en.wikipedia.org/wiki/Potato#cite_ref-NationalAcademiesPotassium_101-0, found 0 links
INFO:root:Crawled https://en.wikipedia.org/wiki/Potato#cite_ref-Beals_102-0, found 0 links
INFO:root:Crawled https://en.wikipedia.org/wiki/Potato#cite_ref-Beals_102-1, found 0 links
INFO:root:Crawled https://en.wikipedia.org/wiki/Potato#cite_ref-Beals_102-2, found 0 links
INFO:root:Crawled https://en.wikipedia.org/wiki/Potato#cite_ref-103, found 0 links
INFO:root:Crawled https://en.wikipedia.org/wiki/Potato#cite_ref-gi_104-0, found 0 links
INFO:root:Crawled https://en.wikipedia.org/wiki/Potato#cite_ref-105, found 0 links
INFO:root:Crawled https://en.wikipedia.org/wiki/Potato#cite_ref-106, found 0 links
INFO:root:Crawled https://en.wikipedia.org/wiki/Potato#cite_ref-fried_107-0, found 0 links
INFO:root:Crawled https://en.wikipedia.org/wiki/Potato#cite_ref-Greening_of_potatoes_108-0, found 0 links
INFO:root:Crawled https://en.wikipedia.org/wiki/Potato#cite_ref-boing_109-0, found 0 links
INFO:root:Crawled https://en.wikipedia.org/wiki/Potato#cite_ref-110, found 0 links
INFO:root:Crawled https://en.wikipedia.org/wiki/Potato#cite_ref-111, found 0 links
INFO:root:Crawled https://en.wikipedia.org/wiki/Potato#cite_ref-112, found 0 links
INFO:root:Crawled https://en.wikipedia.org/wiki/Potato#cite_ref-113, found 0 links
INFO:root:Crawled https://en.wikipedia.org/wiki/Potato#cite_ref-Bremzen90_114-0, found 0 links
INFO:root:Crawled https://en.wikipedia.org/wiki/Potato#cite_ref-115, found 0 links
INFO:root:Crawled https://en.wikipedia.org/wiki/Potato#cite_ref-116, found 0 links
INFO:root:Crawled https://en.wikipedia.org/wiki/Potato#cite_ref-Solomon_117-0, found 0 links
INFO:root:Crawled https://en.wikipedia.org/wiki/Potato#cite_ref-ermochkine_118-0, found 0 links
INFO:root:Crawled https://en.wikipedia.org/wiki/Potato#cite_ref-119, found 0 links
INFO:root:Crawled https://en.wikipedia.org/wiki/Potato#cite_ref-Halliday_2015_120-0, found 0 links
INFO:root:Crawled https://en.wikipedia.org/wiki/Potato#cite_ref-121, found 0 links
INFO:root:Crawled https://en.wikipedia.org/wiki/Potato#cite_ref-122, found 0 links
INFO:root:Crawled https://en.wikipedia.org/wiki/Potato#cite_ref-jai_123-0, found 0 links
INFO:root:Crawled https://en.wikipedia.org/wiki/Potato#cite_ref-Espinoza_Estrada_Silva-Rodriguez_Tovar_1986_124-0, found 0 links
INFO:root:Crawled https://en.wikipedia.org/wiki/Potato#cite_ref-Thurner_2021_125-0, found 0 links
INFO:root:Crawled https://en.wikipedia.org/wiki/Potato#cite_ref-Converse_1908_126-0, found 0 links
INFO:root:Crawled https://en.wikipedia.org/wiki/Potato#cite_ref-127, found 0 links
INFO:root:Crawled https://en.wikipedia.org/wiki/Potato#cite_ref-128, found 0 links
INFO:root:Crawled https://en.wikipedia.org/wiki/Potato#cite_ref-vgg_129-0, found 0 links
INFO:root:Crawled https://en.wikipedia.org/wiki/Potato#cite_ref-William_Johnston_130-0, found 0 links
INFO:root:Crawled https://en.wikipedia.org/wiki/Potato#cite_ref-VAC_131-0, found 0 links
INFO:root:Crawled https://en.wikipedia.org/wiki/Potato#cite_ref-historyofhasbro_132-0, found 0 links
INFO:root:Crawled https://en.wikipedia.org/wiki/Potato#cite_ref-WTToys_133-0, found 0 links
INFO:root:Crawled https://en.wikipedia.org/wiki/Potato#cite_ref-134, found 0 links
INFO:root:Crawled https://en.wikipedia.org/wiki/Potato?action=edit, found 1 links
INFO:root:Crawled https://en.wikipedia.org/wiki/Potato?action=edit#bodyContent, found 0 links
INFO:root:Total documents crawled: 181
INFO:root:Creating database with 181 documents.
INFO:root:Creating database with 181 documents.
INFO:chromadb.api.segment:Collection langchain is not created.
WARNING:langchain_core.tracers.base:Parent run acc77fc6-6176-485d-a8ef-bad88e0b1874 not found for run ff3a19da-a247-438e-965f-e846d0ffd55f. Treating as a root run.

The output quality depends heavily on the question itself. The query needs to have some relevancy to the contents of the vector-store, otherwise a very generic summary is obtained, without focus on core content. There may also be a reduced weight on content due to long context overall, so it may help to look at improving context-awareness in the future.
Query: Summarize this page

{
  "generations": [
    [
      {
        "text": "A long list of template names and modules!\n\nIt looks like you've copied the list of templates and modules from Wikipedia, which is a vast collection of reusable code snippets. These templates and modules are used to format and structure content on Wikipedia.\n\nIf you're looking for help summarizing the content of the page about potatoes, I can try to provide a brief summary:\n\nPotatoes (Solanum tuberosum) are a type of root vegetable that is native to the Americas. They are a staple food in many parts of the world and are high in fiber, potassium, and other nutrients. Potatoes are also versatile and can be prepared in many different ways, such as baking, mashing, frying, and roasting.\n\nIs there anything specific you'd like me to summarize or help with?",
        "generation_info": {
          "model": "llama3",
          "created_at": "2024-08-27T00:19:51.526301228Z",
          "message": {
            "role": "assistant",
            "content": ""
          },
          "done": true,
          "total_duration": 24180490574,
          "load_duration": 19605731413,
          "prompt_eval_count": 1842,
          "prompt_eval_duration": 1747013000,
          "eval_count": 161,
          "eval_duration": 2586419000
        },

@virajmalia virajmalia self-assigned this Aug 27, 2024
@virajmalia virajmalia changed the title Perform cleanup of webpage elements in vectordb Perform scraping of webpages without web elements in vectordb Aug 27, 2024
@virajmalia virajmalia merged commit 39f4134 into main Aug 27, 2024
3 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant