自拍偷在线精品自拍偷,亚洲欧美中文日韩v在线观看不卡

<ruby id="sf6i5"></ruby>

<p id="sf6i5"><li id="sf6i5"></li></p>

51CTO首頁(yè)

AI.x社區(qū)

軟考社區(qū)

免費(fèi)課

企業(yè)培訓(xùn)

鴻蒙開發(fā)者社區(qū)

WOT技術(shù)大會(huì)

公眾號(hào)矩陣

移動(dòng)端

視頻課免費(fèi)課排行榜短視頻直播課軟考學(xué)堂

全部課程軟考華為認(rèn)證廠商認(rèn)證 IT技術(shù)PMP項(xiàng)目管理免費(fèi)題庫(kù)

在線學(xué)習(xí)

文章資源問(wèn)答課堂專欄直播

51CTO

鴻蒙開發(fā)者社區(qū)

51CTO技術(shù)棧

51CTO官微

51CTO學(xué)堂

51CTO博客

CTO訓(xùn)練營(yíng)

鴻蒙開發(fā)者社區(qū)訂閱號(hào)

51CTO軟考

51CTO學(xué)堂APP

51CTO學(xué)堂企業(yè)版APP

鴻蒙開發(fā)者社區(qū)視頻號(hào)

51CTO軟考題庫(kù)

AI.x社區(qū)

登錄/注冊(cè)
51CTO

中國(guó)優(yōu)質(zhì)的IT技術(shù)網(wǎng)站

51CTO博客

專業(yè)IT技術(shù)創(chuàng)作平臺(tái)

51CTO學(xué)堂

IT職業(yè)在線教育平臺(tái)

Facebook開源大模型可視分析工具：Transparency Tool ，將Transformer扒的一干二凈原創(chuàng) 精華

發(fā)布于 2024-5-24 11:27

瀏覽

0收藏

Transparency Tool是facebook開源的大語(yǔ)言模型可視分析工具，用于分析基于Transformer架構(gòu)的語(yǔ)言模型。

源碼：https://github.com/facebookresearch/llm-transparency-tool
技術(shù)報(bào)告：https://arxiv.org/pdf/2404.07004.pdf

一、淺談原理

Transformer是由多個(gè)注意力塊堆疊而成，每個(gè)注意力塊視為一層，每個(gè)層包含一個(gè)多頭注意力層和一個(gè)前饋網(wǎng)絡(luò)。token向量在注意力層中，向量之間能夠相互交流，并根據(jù)彼此信息更新自身的值；在前饋網(wǎng)絡(luò)中，向量的值會(huì)被修改。

Facebook開源大模型可視分析工具：Transparency Tool ，將Transformer扒的一干二凈 -AI.x社區(qū)

Transparency Tool將模型的前向推理過(guò)程構(gòu)建成一個(gè)信息流圖，圖的節(jié)點(diǎn)表示token向量，圖的邊表示操作。以次來(lái)追蹤和可視化模型內(nèi)部的信息流動(dòng)路徑，同時(shí)允許檢查單個(gè)注意力頭和神經(jīng)元的貢獻(xiàn)。

Facebook開源大模型可視分析工具：Transparency Tool ，將Transformer扒的一干二凈 -AI.x社區(qū)

二、在線體驗(yàn)

https://huggingface.co/spaces/facebook/llm-transparency-tool-demo

在huggingface上可以體驗(yàn)一下這個(gè)在線demo，在左側(cè)選擇模型，默認(rèn)gpt2，在上方選擇輸入提示文本，默認(rèn)“When Mary and John went to the store, John gave a drink to”。

Facebook開源大模型可視分析工具：Transparency Tool ，將Transformer扒的一干二凈 -AI.x社區(qū)

Graph就是構(gòu)建的信息流圖，在Graph中圓圈代表節(jié)點(diǎn)，線代表邊。點(diǎn)擊token“to”最后一層的節(jié)點(diǎn)，右側(cè)Top Tokens中排在第一個(gè)的是token“Mary”。

Facebook開源大模型可視分析工具：Transparency Tool ，將Transformer扒的一干二凈 -AI.x社區(qū)

同樣，通過(guò)點(diǎn)擊連接節(jié)點(diǎn)的注意力邊，您可以探索形成相關(guān)連接的頭的模式。

Facebook開源大模型可視分析工具：Transparency Tool ，將Transformer扒的一干二凈 -AI.x社區(qū)

Facebook開源大模型可視分析工具：Transparency Tool ，將Transformer扒的一干二凈 -AI.x社區(qū)

如果想了解圖的詳細(xì)構(gòu)建過(guò)程以及如何通過(guò)圖進(jìn)行分析，請(qǐng)參考下面這篇論文。

https://arxiv.org/pdf/2403.00824.pdf

三、私人定制

huggingface只提供了gpt2、distilgpt2、facebook/opt-125m三個(gè)模型，如何加載自己的模型呢？

Transparency Tool是基于TransformerLens開發(fā)的，TransformerLens是一個(gè)專注于生成語(yǔ)言模型（如GPT-2風(fēng)格的模型）的可解釋性的庫(kù)。其核心目標(biāo)是利用訓(xùn)練好的模型，通過(guò)分析模型的內(nèi)部工作機(jī)制，來(lái)提供對(duì)模型行為的深入理解。

https://github.com/neelnanda-io/TransformerLens

所以，凡是TransformerLens支持的模型，Transparency Tool都能支持。

對(duì)于TransformerLens不支持的模型，需要實(shí)現(xiàn)自己的TransparentLlm類。

首先要搭建本地環(huán)境。

Dockerized running

# From the repository root directory
docker build -t llm_transparency_tool .
docker run --rm -p 7860:7860 llm_transparency_tool

Local Installation

# download
git clone git@github.com:facebookresearch/llm-transparency-tool.git
cd llm-transparency-tool


# install the necessary packages
conda env create --name llmtt -f env.yaml
# install the `llm_transparency_tool` package
pip install -e .


# now, we need to build the frontend
# don't worry, even `yarn` comes preinstalled by `env.yaml`
cd llm_transparency_tool/components/frontend
yarn install
yarn build

修改配置文件config/local.json，將模型添加到model中。

{
    "allow_loading_dataset_files": true,
    "preloaded_dataset_filename": "sample_input.txt",
    "debug": true,
    "models": {
        "": null,


        "gpt2": null,
        "distilgpt2": null,
        "facebook/opt-125m": null,
        "facebook/opt-1.3b": null,
        "EleutherAI/gpt-neo-125M": null,
        "Qwen/Qwen-1_8B": null,
        "Qwen/Qwen1.5-0.5B": null,
        "Qwen/Qwen1.5-0.5B-Chat": null,
        "Qwen/Qwen1.5-1.8B": null,
        "Qwen/Qwen1.5-1.8B-Chat": null,
        "microsoft/phi-1": null,
        "microsoft/phi-1_5": null,
        "microsoft/phi-2": null,


        "meta-llama/Llama-2-7b-hf": null,
        "meta-llama/Llama-2-7b-chat-hf": null,


        "meta-llama/Llama-2-13b-hf": null,
        "meta-llama/Llama-2-13b-chat-hf": null,




        "gpt2-medium": null,
        "gpt2-large": null,
        "gpt2-xl": null,


        "mistralai/Mistral-7B-v0.1": null,
        "mistralai/Mistral-7B-Instruct-v0.1": null,
        "mistralai/Mistral-7B-Instruct-v0.2": null,


        "google/gemma-7b": null,
        "google/gemma-2b": null,


        "facebook/opt-2.7b": null,
        "facebook/opt-6.7b": null,
        "facebook/opt-13b": null,
        "facebook/opt-30b": null
    },
    "default_model": "",
    "demo_mode": false
}

啟動(dòng)

streamlit run llm_transparency_tool/server/app.py -- config/local.json

本文轉(zhuǎn)載自公眾號(hào)人工智能大講堂

原文鏈接：??https://mp.weixin.qq.com/s/TSOkh5LEnE0sraE6yGRaCw??

?著作權(quán)歸作者所有，如需轉(zhuǎn)載，請(qǐng)注明出處，否則將追究法律責(zé)任

標(biāo)簽

開源大模型

可視分析工具

贊

收藏

回復(fù)

舉報(bào)

回復(fù)

相關(guān)推薦

激發(fā)大語(yǔ)言模型空間推理能力：思維可視化提示

AIGC最前線 ? 4721瀏覽 ? 0回復(fù)
Google開源大模型新成員CodeGemma、RecurrentGemma，繼Transformer后新架構(gòu)Griffin誕生

AIGC最前線 ? 3403瀏覽 ? 0回復(fù)
LLMCompiler：大模型的并行工具調(diào)用

AIGC最前線 ? 4135瀏覽 ? 0回復(fù)
神器Pandas AI: 一款智能做數(shù)據(jù)分析的工具！

開發(fā)者阿橙 ? 4121瀏覽 ? 0回復(fù)
開源的金融分析工具，Llama3-70B-Instruct模型編織開放的金融智能網(wǎng)

xuxiangda ? 3040瀏覽 ? 0回復(fù)
谷歌推出全新模型，將Transformer與NAR相結(jié)合

Aceryt ? 2275瀏覽 ? 0回復(fù)
【創(chuàng)新一夏學(xué)習(xí)季】熱浪升溫，創(chuàng)新一夏，釋放開發(fā)潛能

AI.x社區(qū)官方賬號(hào) ? 52.8w瀏覽 ? 39回復(fù)
支持大模型流式輸出的JSON提取工具

恰似驚鴻 ? 2968瀏覽 ? 0回復(fù)
一款好用的開源工具，高效實(shí)現(xiàn)Reranker

恰似驚鴻 ? 3379瀏覽 ? 0回復(fù)
聊聊 VMD + CEEMDAN 二次分解，TCN-Transformer并行預(yù)測(cè)模型

Tang_Lan ? 3599瀏覽 ? 0回復(fù)
Pandas AI: 一款可以智能做數(shù)據(jù)分析的工具！

Halo咯咯 ? 3011瀏覽 ? 0回復(fù)
數(shù)據(jù)分析自動(dòng)化：LIDA智能可視化的魔法！

Halo咯咯 ? 2087瀏覽 ? 0回復(fù)
Meta開源“記憶層”，重塑Transformer架構(gòu)大模型

Aceryt ? 1990瀏覽 ? 0回復(fù)
2025年大模型與Transformer架構(gòu)：技術(shù)前沿與未來(lái)趨勢(shì)報(bào)告

歐米伽未來(lái)研究所 ? 6056瀏覽 ? 0回復(fù)
OpenAI將開源 o3-mini，或適合手機(jī)大模型

Aceryt ? 1598瀏覽 ? 0回復(fù)
解鎖Transformer核心！一文吃透自注意力機(jī)制

人工智能訓(xùn)練營(yíng) ? 2962瀏覽 ? 0回復(fù)
扒一扒最近較火的MCP

魯班模錘1 ? 1235瀏覽 ? 0回復(fù)
有一款神器！深入探索Transformer語(yǔ)言模型的可視化工具BertViz

智駐未來(lái) ? 774瀏覽 ? 0回復(fù)
開發(fā)者不寫代碼也能做時(shí)序分析？字節(jié)跳動(dòng) ChatTS 用大模型干掉傳統(tǒng)工具！

凝固的雨_1 ? 699瀏覽 ? 0回復(fù)

這個(gè)用戶很懶，還沒(méi)有個(gè)人簡(jiǎn)介

帖子

聲望

粉絲

關(guān)注

最近發(fā)布

訓(xùn)練大模型時(shí)，顯存都哪去了？ 2024-11-19 12:41:34發(fā)布
生產(chǎn)環(huán)境測(cè)試模型的四種方法 2024-11-15 11:22:05發(fā)布

熱門推薦

大半精銳盡出！o1下線！滿血o3之后，模型本身就是Manus，最大賣點(diǎn)：替代人干真活！ 1回復(fù)

王炸！MCP 架構(gòu)設(shè)計(jì)深度剖析 & 使用 Spring AI + MCP 四步教你實(shí)現(xiàn) Agent 智能體開發(fā) 0回復(fù)

Dify從入門到高階系列二：手把手教學(xué)！超詳細(xì)的Dify知識(shí)庫(kù)配置全攻略 0回復(fù)

Crawl4AI：GitHub榜首40K星標(biāo)！LLM專屬極速開源爬蟲神器 0回復(fù)

只需5分鐘，教你用Python搭建MCP Server 0回復(fù)

下一篇：谷歌多模態(tài)大模型ScreenAI：帶來(lái)人機(jī)界面交互新方式

社區(qū)精華內(nèi)容

目錄

<cite id="wovxv"><track id="wovxv"></track></cite>