自拍偷在线精品自拍偷,亚洲欧美中文日韩v在线观看不卡

<menuitem id="m3x2r"></menuitem>

<wbr id="m3x2r"></wbr>

AI.x社區(qū)

軟考社區(qū)

免費(fèi)課

企業(yè)培訓(xùn)

鴻蒙開發(fā)者社區(qū)

WOT技術(shù)大會(huì)

公眾號(hào)矩陣

移動(dòng)端

視頻課免費(fèi)課排行榜短視頻直播課軟考學(xué)堂

全部課程軟考華為認(rèn)證廠商認(rèn)證 IT技術(shù)PMP項(xiàng)目管理免費(fèi)題庫

在線學(xué)習(xí)

文章資源問答課堂專欄直播

51CTO

鴻蒙開發(fā)者社區(qū)

51CTO技術(shù)棧

51CTO官微

51CTO學(xué)堂

51CTO博客

CTO訓(xùn)練營

鴻蒙開發(fā)者社區(qū)訂閱號(hào)

51CTO軟考

51CTO學(xué)堂APP

51CTO學(xué)堂企業(yè)版APP

鴻蒙開發(fā)者社區(qū)視頻號(hào)

51CTO軟考題庫

AI.x社區(qū)

登錄/注冊(cè)
51CTO

中國優(yōu)質(zhì)的IT技術(shù)網(wǎng)站

51CTO博客

專業(yè)IT技術(shù)創(chuàng)作平臺(tái)

51CTO學(xué)堂

IT職業(yè)在線教育平臺(tái)

Auto-Retrieval: RAG的智能進(jìn)化原創(chuàng)

發(fā)布于 2024-10-23 10:21

瀏覽

0收藏

Auto-Retrieval是一種高級(jí)的RAG技術(shù)，它在啟動(dòng)向量數(shù)據(jù)庫檢索之前使用Agent LLM動(dòng)態(tài)推斷元數(shù)據(jù)過濾器參數(shù)和語義查詢，而不是將用戶查詢直接發(fā)送到向量數(shù)據(jù)庫檢索接口（例如密集向量搜索）的樸素RAG。您可以將其視為查詢擴(kuò)展/重寫的一種形式，也可以將其視為函數(shù)調(diào)用的一種特定形式；后文我們將給出實(shí)現(xiàn)邏輯和代碼。達(dá)到效果如下：

用戶輸入

Give me a summary of the SWE-bench paper

推理結(jié)果

改寫查詢: summary of the SWE-bench paper
過濾參數(shù): {"filters": [{"key": "file_name", "value": "swebench.pdf", "operator": "=="}], "condition": "and"}

實(shí)現(xiàn)步驟

我們借助LlamaCloud來實(shí)現(xiàn)，主要通過在LlamaCloud檢索器上設(shè)置一個(gè)Auto-Retrieval功能。在高層次上，我們的自動(dòng)檢索函數(shù)使用一個(gè)調(diào)用函數(shù)的LLM來推斷用戶查詢的元數(shù)據(jù)過濾器——比僅僅使用原始語義查詢產(chǎn)生更精確和相關(guān)的檢索結(jié)果。

定義一個(gè)自定義prompt來生成元數(shù)據(jù)過濾器
給定一個(gè)用戶查詢，首先執(zhí)行塊級(jí)檢索，從檢索到的塊中動(dòng)態(tài)召回元數(shù)據(jù)。
在auto-retrieval prompt中注入元數(shù)據(jù)作為少量示例。目的是向LLM展示現(xiàn)有的、相關(guān)的元數(shù)據(jù)值示例，以便LLM可以推斷出正確的元數(shù)據(jù)過濾器。

文檔級(jí)檢索器返回整個(gè)文件級(jí)別的文檔，而塊級(jí)檢索器返回特定的塊，實(shí)現(xiàn)如此簡(jiǎn)單。

from llama_index.indices.managed.llama_cloud import LlamaCloudIndex
import os


index = LlamaCloudIndex(
  name="research_papers_page",
  project_name="llamacloud_demo",
  api_key=os.environ["LLAMA_CLOUD_API_KEY"]
)


doc_retriever = index.as_retriever(
    retrieval_mode="files_via_content",
    # retrieval_mode="files_via_metadata",
    files_top_k=1
)


chunk_retriever = index.as_retriever(
    retrieval_mode="chunks",
    rerank_top_n=5
)

代碼實(shí)現(xiàn)

接下來我們將根據(jù)上面的流程給出實(shí)現(xiàn)代碼：

from llama_index.core.prompts import ChatPromptTemplate
from llama_index.core.vector_stores.types import VectorStoreInfo, VectorStoreQuerySpec, MetadataInfo, MetadataFilters
from llama_index.core.retrievers import BaseRetriever
from llama_index.core.query_engine import RetrieverQueryEngine
from llama_index.core import Response


import json


SYS_PROMPT = """\
Your goal is to structure the user's query to match the request schema provided below.
You MUST call the tool in order to generate the query spec.


<< Structured Request Schema >>
When responding use a markdown code snippet with a JSON object formatted in the \
following schema:


{schema_str}


The query string should contain only text that is expected to match the contents of \
documents. Any conditions in the filter should not be mentioned in the query as well.


Make sure that filters only refer to attributes that exist in the data source.
Make sure that filters take into account the descriptions of attributes.
Make sure that filters are only used as needed. If there are no filters that should be \
applied return [] for the filter value.\


If the user's query explicitly mentions number of documents to retrieve, set top_k to \
that number, otherwise do not set top_k.


The schema of the metadata filters in the vector db table is listed below, along with some example metadata dictionaries from relevant rows.
The user will send the input query string.


Data Source:
```json
{info_str}
```


Example metadata from relevant chunks:
{example_rows}


"""


example_rows_retriever = index.as_retriever(
    retrieval_mode="chunks",
    rerank_top_n=4
)


def get_example_rows_fn(**kwargs):
    """Retrieve relevant few-shot examples."""
    query_str = kwargs["query_str"]
    nodes = example_rows_retriever.retrieve(query_str)
    # get the metadata, join them
    metadata_list = [n.metadata for n in nodes]


    return "\n".join([json.dumps(m) for m in metadata_list])
        
    


# TODO: define function mapping for `example_rows`.
chat_prompt_tmpl = ChatPromptTemplate.from_messages(
    [
        ("system", SYS_PROMPT),
        ("user", "{query_str}"),
    ],
    function_mappings={
        "example_rows": get_example_rows_fn
    }
)




## NOTE: this is a dataclass that contains information about the metadata
vector_store_info = VectorStoreInfo(
    content_info="contains content from various research papers",
    metadata_info=[
        MetadataInfo(
            name="file_name",
            type="str",
            description="Name of the source paper",
        ),
    ],
)


def auto_retriever_rag(query: str, retriever: BaseRetriever) -> Response:
    """Synthesizes an answer to your question by feeding in an entire relevant document as context."""
    print(f"> User query string: {query}")
    # Use structured predict to infer the metadata filters and query string.
    query_spec = llm.structured_predict(
        VectorStoreQuerySpec,
        chat_prompt_tmpl,
        info_str=vector_store_info.json(indent=4),
        schema_str=VectorStoreQuerySpec.schema_json(indent=4),
        query_str=query
    )
    # build retriever and query engine
    filters = MetadataFilters(filters=query_spec.filters) if len(query_spec.filters) > 0 else None
    print(f"> Inferred query string: {query_spec.query}")
    if filters:
        print(f"> Inferred filters: {filters.json()}")
    query_engine = RetrieverQueryEngine.from_args(
        retriever, 
        llm=llm,
        response_mode="tree_summarize"
    )
    # run query
    return query_engine.query(query_spec.query)

效果展示

auto_doc_rag("Give me a summary of the SWE-bench paper") 
print(str(response))

> User query string: Give me a summary of the SWE-bench paper
> Inferred query string: summary of the SWE-bench paper
> Inferred filters: {"filters": [{"key": "file_name", "value": "swebench.pdf", "operator": "=="}], "condition": "and"}
The construction of SWE-Bench involves a three-stage pipeline:


1. **Repo Selection and Data Scraping**: Pull requests (PRs) are collected from 12 popular open-source Python repositories on GitHub, resulting in approximately 90,000 PRs. These repositories are chosen for their popularity, better maintenance, clear contributor guidelines, and extensive test coverage.


2. **Attribute-Based Filtering**: Candidate tasks are created by selecting merged PRs that resolve a GitHub issue and make changes to the test files of the repository. This indicates that the user likely contributed tests to check whether the issue has been resolved.


3. **Execution-Based Filtering**: For each candidate task, the PR’s test content is applied, and the associated test results are logged before and after the PR’s other content is applied. Tasks are filtered out if they do not have at least one test where its status changes from fail to pass or if they result in installation or runtime errors.


Through these stages, the original 90,000 PRs are filtered down to 2,294 task instances that comprise SWE-Bench.

本文轉(zhuǎn)載自公眾號(hào)哎呀AIYA

原文鏈接：??https://mp.weixin.qq.com/s/wcmJ3OQzDxx_ILo_m7zA2Q??

?著作權(quán)歸作者所有，如需轉(zhuǎn)載，請(qǐng)注明出處，否則將追究法律責(zé)任

標(biāo)簽

已于2024-10-23 10:22:56修改

贊

收藏

回復(fù)

舉報(bào)

回復(fù)

相關(guān)推薦

守護(hù)生成式人工智能之門，規(guī)避人工智能進(jìn)化中的安全挑戰(zhàn)

51CTO內(nèi)容精選 ? 2685瀏覽 ? 0回復(fù)
人工智能利維坦：從霍布斯社會(huì)契約論視角探索LLM 智能體的社會(huì)進(jìn)化

xuxiangda ? 3799瀏覽 ? 0回復(fù)
“劇本殺”進(jìn)化體！多智能體謀殺案來了

51CTO技術(shù)棧 ? 2338瀏覽 ? 0回復(fù)
Anthropic提出Contextual Retrieval讓RAG再進(jìn)化，大幅降低檢索失敗率

Syrupup ? 3095瀏覽 ? 0回復(fù)
【智匯金秋創(chuàng)造季】智匯成海，致敬開發(fā)者的“超級(jí)碼力”！

AI.x社區(qū)官方賬號(hào) ? 33.0w瀏覽 ? 148回復(fù)
智能決策進(jìn)化之路：從長(zhǎng)上下文LLM到自主RAG系統(tǒng)

Halo咯咯 ? 2984瀏覽 ? 0回復(fù)
引入上下文檢索(Contextual Retrieval)：提升AI模型的精準(zhǔn)度與效率

Halo咯咯 ? 2123瀏覽 ? 0回復(fù)
Anthropic研究團(tuán)隊(duì)提出新技術(shù)，引入Contextual Retrieval讓RAG再進(jìn)化，大幅降低檢索失敗率

AI博物院 ? 2236瀏覽 ? 0回復(fù)
【人工智能】突破AI邊界！深度解析Retrieval Augmented Generation（RAG）助力企業(yè)智能化升級(jí)

唐克 ? 2065瀏覽 ? 0回復(fù)
RAG再進(jìn)化？基于長(zhǎng)期記憶的檢索增強(qiáng)生成新范式-MemoRAG

大模型自然語言處理 ? 2156瀏覽 ? 0回復(fù)
2024年人工智能進(jìn)展：10大開創(chuàng)性研究亮點(diǎn)

十一月雨_55 ? 1.4w瀏覽 ? 0回復(fù)
Auto-RAG開源，復(fù)雜多跳問題就這么解決了！

PaperAgent ? 2689瀏覽 ? 0回復(fù)
最新開源Auto-RAG：最低成本解決多跳問題

AIGC前沿技術(shù)追蹤 ? 3650瀏覽 ? 0回復(fù)
《基礎(chǔ)代理的進(jìn)步與挑戰(zhàn)，從大腦啟發(fā)智能到進(jìn)化、協(xié)作和安全系統(tǒng)》第一部分：智能代理的核心組件

xuxiangda ? 830瀏覽 ? 0回復(fù)
【人工智能】AI如何精準(zhǔn)匹配RAG知識(shí)庫？揭秘混合檢索的奧秘！

唐克 ? 1418瀏覽 ? 0回復(fù)
RAG 準(zhǔn)確率告急？金融大佬 Mike Conover 親授：構(gòu)建高保真知識(shí)智能體的實(shí)戰(zhàn)秘笈

凝固的雨_1 ? 1894瀏覽 ? 0回復(fù)
人工智能進(jìn)入 “下半場(chǎng)”，未來將走向何方？

十一月雨_55 ? 658瀏覽 ? 0回復(fù)
構(gòu)建基于LangGraph的RAG多智能體研究工具

Halo咯咯 ? 462瀏覽 ? 0回復(fù)
基礎(chǔ)智能體的進(jìn)展與挑戰(zhàn)：自進(jìn)化機(jī)制和構(gòu)建群體MAS系統(tǒng)

數(shù)字化助推器 ? 212瀏覽 ? 0回復(fù)

這個(gè)用戶很懶，還沒有個(gè)人簡(jiǎn)介

帖子

聲望

粉絲

關(guān)注

最近發(fā)布

LLM-R：基于RAG和層次化Agent落地案例解析 2024-11-15 09:58:18發(fā)布
TextIn：一款優(yōu)秀的文檔解析神器，提升RAG性能必備 2024-11-13 09:10:07發(fā)布

熱門推薦

大半精銳盡出！o1下線！滿血o3之后，模型本身就是Manus，最大賣點(diǎn)：替代人干真活！ 1回復(fù)

王炸！MCP 架構(gòu)設(shè)計(jì)深度剖析 & 使用 Spring AI + MCP 四步教你實(shí)現(xiàn) Agent 智能體開發(fā) 0回復(fù)

Dify從入門到高階系列二：手把手教學(xué)！超詳細(xì)的Dify知識(shí)庫配置全攻略 0回復(fù)

Crawl4AI：GitHub榜首40K星標(biāo)！LLM專屬極速開源爬蟲神器 0回復(fù)

只需5分鐘，教你用Python搭建MCP Server 0回復(fù)

上一篇：文檔概要索引，簡(jiǎn)單提升檢索性能的新選擇

下一篇：騰訊Hunyuan超越Llama 3，成為NLP領(lǐng)域新霸主

社區(qū)精華內(nèi)容

目錄

<samp id="oxyyg"></samp>