自拍偷在线精品自拍偷,亚洲欧美中文日韩v在线观看不卡

51CTO首頁(yè)

AI.x社區(qū)

軟考社區(qū)

免費(fèi)課

企業(yè)培訓(xùn)

鴻蒙開(kāi)發(fā)者社區(qū)

WOT技術(shù)大會(huì)

公眾號(hào)矩陣

移動(dòng)端

視頻課免費(fèi)課排行榜短視頻直播課軟考學(xué)堂

全部課程軟考華為認(rèn)證廠商認(rèn)證 IT技術(shù)PMP項(xiàng)目管理免費(fèi)題庫(kù)

在線學(xué)習(xí)

文章資源問(wèn)答課堂專欄直播

51CTO

鴻蒙開(kāi)發(fā)者社區(qū)

51CTO技術(shù)棧

51CTO官微

51CTO學(xué)堂

51CTO博客

CTO訓(xùn)練營(yíng)

鴻蒙開(kāi)發(fā)者社區(qū)訂閱號(hào)

51CTO軟考

51CTO學(xué)堂APP

51CTO學(xué)堂企業(yè)版APP

鴻蒙開(kāi)發(fā)者社區(qū)視頻號(hào)

51CTO軟考題庫(kù)

AI.x社區(qū)

登錄/注冊(cè)
51CTO

中國(guó)優(yōu)質(zhì)的IT技術(shù)網(wǎng)站

51CTO博客

專業(yè)IT技術(shù)創(chuàng)作平臺(tái)

51CTO學(xué)堂

IT職業(yè)在線教育平臺(tái)

在神經(jīng)網(wǎng)絡(luò)中實(shí)現(xiàn)反向傳播

mb61e52f0ac174a

發(fā)布于 2024-3-27 15:40

瀏覽

0收藏

建立神經(jīng)網(wǎng)絡(luò)時(shí)，需要采取幾個(gè)步驟。其中兩個(gè)最重要的步驟是實(shí)現(xiàn)正向和反向傳播。這兩個(gè)詞聽(tīng)起來(lái)真的很沉重，并且總是讓初學(xué)者感到恐懼。但實(shí)際上，如果將這些技術(shù)分解為各自的步驟，則可以正確理解它們。在本文中，我們將專注于反向傳播及其每個(gè)步驟的直觀知識(shí)。

什么是反向傳播?

這只是實(shí)現(xiàn)神經(jīng)網(wǎng)絡(luò)的一項(xiàng)簡(jiǎn)單技術(shù)，允許我們計(jì)算參數(shù)的梯度，以執(zhí)行梯度下降并使成本函數(shù)最小化。許多學(xué)者將反向傳播描述為神經(jīng)網(wǎng)絡(luò)中數(shù)學(xué)上最密集的部分。不過(guò)請(qǐng)放輕松，因?yàn)樵诒疚闹形覀儗⑼耆饷芊聪騻鞑サ拿總€(gè)部分。

在神經(jīng)網(wǎng)絡(luò)中實(shí)現(xiàn)反向傳播-AI.x社區(qū)

實(shí)施反向傳播

假設(shè)一個(gè)簡(jiǎn)單的兩層神經(jīng)網(wǎng)絡(luò)-一個(gè)隱藏層和一個(gè)輸出層。我們可以如下執(zhí)行反向傳播初始化要用于神經(jīng)網(wǎng)絡(luò)的權(quán)重和偏差：這涉及隨機(jī)初始化神經(jīng)網(wǎng)絡(luò)的權(quán)重和偏差。這些參數(shù)的梯度將從反向傳播中獲得，并用于更新梯度下降。


#Import Numpy library
import numpy as np

#set seed for reproducability 
np.random.seed(100)
#We will first initialize the weights and bias needed and store them in a dictionary called W_B
def initialize(num_f, num_h, num_out):
    
    '''
    Description: This function randomly initializes the weights and biases of each layer of the neural network
    
    Input Arguments:
    num_f - number of training features
    num_h -the number of nodes in the hidden layers
    num_out - the number of nodes in the output 
    
    Output: 
    
    W_B - A dictionary of the initialized parameters.
    
    '''
    
    #randomly initialize weights and biases, and proceed to store in a dictionary
    W_B = {
        'W1': np.random.randn(num_h, num_f),
        'b1': np.zeros((num_h, 1)),
        'W2': np.random.randn(num_out, num_h),
        'b2': np.zeros((num_out, 1))
    }
    return W_B

執(zhí)行前向傳播：這涉及到計(jì)算隱藏層和輸出層的線性和激活輸出。

對(duì)于隱藏層：我們將使用如下所示的relu激活功能：


#We will now proceed to create functions for each of our activation functions

def relu (Z):
    
    '''
    Description: This function performs the relu activation function on a given number or matrix. 
    
    Input Arguments:
    Z - matrix or integer
    
    Output: 
    
   relu_Z -  matrix or integer with relu performed on it
    
    '''
    relu_Z = np.maximum(Z,0)
    
    return relu_Z

對(duì)于輸出層：

我們將使用S型激活函數(shù)，如下所示：


def sigmoid (Z):
    
    '''
    Description: This function performs the sigmoid activation function on a given number or matrix. 
    
    Input Arguments:
    Z - matrix or integer
    
    Output: 
    
   sigmoid_Z -  matrix or integer with sigmoid performed on it
    
    '''
    sigmoid_Z = 1 / (1 + (np.exp(-Z)))
    
    return sigmoid_Z

執(zhí)行前向傳播：


#We will now proceed to perform forward propagation

def forward_propagation(X, W_B):    
    '''
    Description: This function performs the forward propagation in a vectorized form 
    
    Input Arguments:
    X - input training examples
    W_B - initialized weights and biases
    
    Output: 
    
   forward_results - A dictionary containing the linear and activation outputs
    
    '''
    
    #Calculate the linear Z for the hidden layer
    Z1 = np.dot(X, W_B['W1'].T)  + W_B['b1']
    
    #Calculate the activation ouput for the hidden layer
    A = relu(Z1)
    
    #Calculate the linear Z for the output layer
    Z2 = np.dot(A, W_B['W2'].T) + W_B['b2']
    
    #Calculate the activation ouput for the ouptu layer
    Y_pred = sigmoid(Z2) 
    
    #Save all ina dictionary 
    forward_results = {"Z1": Z1,
                      "A": A,
                      "Z2": Z2,
                      "Y_pred": Y_pred}
    
    return forward_results

執(zhí)行向后傳播：相對(duì)于與梯度下降相關(guān)的參數(shù)，計(jì)算成本的梯度。在這種情況下，為dLdZ2，dLdW2，dLdb2，dLdZ1，dLdW1和dLdb1。這些參數(shù)將與學(xué)習(xí)率結(jié)合起來(lái)執(zhí)行梯度下降。我們將為許多訓(xùn)練樣本(no_examples)實(shí)現(xiàn)反向傳播的矢量化版本。

分步指南如下：

從傳遞中獲取結(jié)果，如下所示：

forward_results = forward_propagation(X, W_B)
Z1 = forward_results['Z1']
A = forward_results['A']
Z2 = forward_results['Z2']
Y_pred = forward_results['Y_pred']

獲得訓(xùn)練樣本的數(shù)量，如下所示：

no_examples = X.shape[1]

計(jì)算函數(shù)的損失：

L = (1/no_examples) * np.sum(-Y_true * np.log(Y_pred) - (1 - Y_true) * np.log(1 - Y_pred))

計(jì)算每個(gè)參數(shù)的梯度，如下所示：

dLdZ2= Y_pred - Y_true
dLdW2 = (1/no_examples) * np.dot(dLdZ2, A.T)
dLdb2 = (1/no_examples) * np.sum(dLdZ2, axis=1, keepdims=True)
dLdZ1 = np.multiply(np.dot(W_B['W2'].T, dLdZ2), (1 - np.power(A, 2)))
dLdW1 = (1/no_examples) * np.dot(dLdZ1, X.T)
dLdb1 = (1/no_examples) * np.sum(dLdZ1, axis=1, keepdims=True)

將梯度下降所需的計(jì)算梯度存儲(chǔ)在字典中：

gradients = {"dLdW1": dLdW1,
             "dLdb1": dLdb1,
             "dLdW2": dLdW2,
             "dLdb2": dLdb2}

返回?fù)p耗和存儲(chǔ)的梯度：

return gradients, L

這是完整的向后傳播功能：


def backward_propagation(X, W_B, Y_true):
    '''Description: This function performs the backward propagation in a vectorized form 
    
    Input Arguments:
    X - input training examples
    W_B - initialized weights and biases
    Y_True - the true target values of the training examples
    
    Output: 
    
    gradients - the calculated gradients of each parameter
    L - the loss function
    
    '''
    
    # Obtain the forward results from the forward propagation 
    
    forward_results = forward_propagation(X, W_B)
    Z1 = forward_results['Z1']
    A = forward_results['A']
    Z2 = forward_results['Z2']
    Y_pred = forward_results['Y_pred']
    
    #Obtain the number of training samples    
    no_examples = X.shape[1]
    
    # Calculate loss 
    L = (1/no_examples) * np.sum(-Y_true * np.log(Y_pred) - (1 - Y_true) * np.log(1 - Y_pred))
    
    #Calculate the gradients of each parameter needed for gradient descent 
    dLdZ2= Y_pred - Y_true
    dLdW2 = (1/no_examples) * np.dot(dLdZ2, A.T)
    dLdb2 = (1/no_examples) * np.sum(dLdZ2, axis=1, keepdims=True)
    dLdZ1 = np.multiply(np.dot(W_B['W2'].T, dLdZ2), (1 - np.power(A, 2)))
    dLdW1 = (1/no_examples) * np.dot(dLdZ1, X.T)
    dLdb1 = (1/no_examples) * np.sum(dLdZ1, axis=1, keepdims=True)
    
    #Store gradients for gradient descent in a dictionary 
    gradients = {"dLdW1": dLdW1,
             "dLdb1": dLdb1,
             "dLdW2": dLdW2,
             "dLdb2": dLdb2}
    
    return gradients, L

許多人總是認(rèn)為反向傳播很困難，但是正如本文中介紹的情形，事實(shí)并非如此。必須掌握每個(gè)步驟，才能掌握整個(gè)反向傳播技術(shù)。另外，有必要掌握線性代數(shù)和微積分等數(shù)學(xué)知識(shí)，以了解如何計(jì)算每個(gè)函數(shù)的各個(gè)梯度。使用這些工具，反向傳播應(yīng)該是小菜一碟!實(shí)際上，反向傳播通常由使用的深度學(xué)習(xí)框架來(lái)處理。但是，了解這種技術(shù)的內(nèi)在作用是值得的，因?yàn)樗袝r(shí)可以幫助我們理解神經(jīng)網(wǎng)絡(luò)為何訓(xùn)練得不好。

本文轉(zhuǎn)載 ??小白遇見(jiàn)AI?? ，作者：小煩

原文鏈接：??https://mp.weixin.qq.com/s/vx2lqz5o8JchPr226lC9cA??

標(biāo)簽

神經(jīng)網(wǎng)絡(luò)

已于2024-3-27 16:09:56修改

贊

收藏

回復(fù)

舉報(bào)

回復(fù)

相關(guān)推薦

神經(jīng)網(wǎng)絡(luò)的通用訓(xùn)練流程

AI探索時(shí)代 ? 2399瀏覽 ? 0回復(fù)
你知道神經(jīng)網(wǎng)絡(luò)是怎么運(yùn)作的嗎？神經(jīng)網(wǎng)絡(luò)內(nèi)部原理解析

AI探索時(shí)代 ? 2268瀏覽 ? 0回復(fù)
神經(jīng)網(wǎng)絡(luò)與2024諾貝爾物理獎(jiǎng)

魯班模錘1 ? 1953瀏覽 ? 0回復(fù)
手把手從零構(gòu)建神經(jīng)網(wǎng)絡(luò)

Syrupup ? 1899瀏覽 ? 0回復(fù)
優(yōu)雅談大模型：神經(jīng)網(wǎng)絡(luò)與矩陣

魯班模錘1 ? 1945瀏覽 ? 0回復(fù)
什么是神經(jīng)網(wǎng)絡(luò)-終于把神經(jīng)網(wǎng)絡(luò)參數(shù)更新搞明白了！

人工智能訓(xùn)練營(yíng) ? 1641瀏覽 ? 0回復(fù)
大模型之神經(jīng)網(wǎng)絡(luò)特征提取綜述

AI探索時(shí)代 ? 2030瀏覽 ? 0回復(fù)
什么是神經(jīng)網(wǎng)絡(luò)？神經(jīng)網(wǎng)絡(luò)開(kāi)發(fā)框架——PyTorch和架構(gòu)Transformer的區(qū)別和聯(lián)系

AI探索時(shí)代 ? 2468瀏覽 ? 0回復(fù)
神經(jīng)網(wǎng)絡(luò)是怎么學(xué)習(xí)的？

AI探索時(shí)代 ? 1653瀏覽 ? 0回復(fù)
關(guān)于神經(jīng)網(wǎng)絡(luò)的輸入格式——數(shù)據(jù)集的處理，關(guān)于神經(jīng)網(wǎng)絡(luò)模型的結(jié)構(gòu)說(shuō)明

AI探索時(shí)代 ? 1840瀏覽 ? 0回復(fù)
神經(jīng)網(wǎng)絡(luò)理論與技術(shù)學(xué)習(xí)以及工具

AI探索時(shí)代 ? 1615瀏覽 ? 0回復(fù)
什么是神經(jīng)網(wǎng)絡(luò)-終于把神經(jīng)網(wǎng)絡(luò)參數(shù)更新搞明白了反向傳播詳解

人工智能訓(xùn)練營(yíng) ? 1867瀏覽 ? 0回復(fù)
再談什么是神經(jīng)網(wǎng)絡(luò)，透過(guò)現(xiàn)象看本質(zhì)

AI探索時(shí)代 ? 1500瀏覽 ? 0回復(fù)
怎么實(shí)現(xiàn)一個(gè)神經(jīng)網(wǎng)絡(luò)？神經(jīng)網(wǎng)絡(luò)的組成結(jié)構(gòu)

AI探索時(shí)代 ? 1559瀏覽 ? 0回復(fù)
不同神經(jīng)網(wǎng)絡(luò)之間的區(qū)別，僅僅只是網(wǎng)絡(luò)結(jié)構(gòu)的不同，明白了這個(gè)你才能知道應(yīng)該怎么學(xué)習(xí)神經(jīng)網(wǎng)絡(luò)

AI探索時(shí)代 ? 1585瀏覽 ? 0回復(fù)
神經(jīng)網(wǎng)絡(luò)技術(shù)的核心之——反向傳播算法(BP算法)

AI探索時(shí)代 ? 1963瀏覽 ? 0回復(fù)
神經(jīng)網(wǎng)絡(luò)的每一層都是干嘛的？這才是神經(jīng)網(wǎng)絡(luò)結(jié)構(gòu)的核心

AI探索時(shí)代 ? 1456瀏覽 ? 0回復(fù)
什么是神經(jīng)網(wǎng)絡(luò)-循環(huán)神經(jīng)網(wǎng)絡(luò)RNN各層詳解及實(shí)例展示

人工智能訓(xùn)練營(yíng) ? 1625瀏覽 ? 0回復(fù)
什么是神經(jīng)網(wǎng)絡(luò)：反向傳播如何更新網(wǎng)絡(luò)參數(shù)

人工智能訓(xùn)練營(yíng) ? 285瀏覽 ? 0回復(fù)

mb61e52f0ac174a

這個(gè)用戶很懶，還沒(méi)有個(gè)人簡(jiǎn)介

帖子

聲望

粉絲

關(guān)注

最近發(fā)布

擁擠場(chǎng)景中基于深度學(xué)習(xí)的目標(biāo)檢測(cè) 2024-03-27 16:16:41發(fā)布
生物醫(yī)學(xué)圖像分割與目標(biāo)檢測(cè)：UOLO 2024-03-27 16:07:27發(fā)布

熱門(mén)推薦

大半精銳盡出！o1下線！滿血o3之后，模型本身就是Manus，最大賣(mài)點(diǎn)：替代人干真活！ 1回復(fù)

王炸！MCP 架構(gòu)設(shè)計(jì)深度剖析 & 使用 Spring AI + MCP 四步教你實(shí)現(xiàn) Agent 智能體開(kāi)發(fā) 0回復(fù)

Dify從入門(mén)到高階系列二：手把手教學(xué)！超詳細(xì)的Dify知識(shí)庫(kù)配置全攻略 0回復(fù)

Crawl4AI：GitHub榜首40K星標(biāo)！LLM專屬極速開(kāi)源爬蟲(chóng)神器 0回復(fù)

只需5分鐘，教你用Python搭建MCP Server 0回復(fù)

上一篇： YOLO v4：物體檢測(cè)的最佳速度和精度

下一篇：生物醫(yī)學(xué)圖像分割與目標(biāo)檢測(cè)：UOLO

社區(qū)精華內(nèi)容

目錄

<blockquote id="imf6p"><rt id="imf6p"></rt></blockquote>