自拍偷在线精品自拍偷,亚洲欧美中文日韩v在线观看不卡

AI.x社區(qū)

軟考社區(qū)

免費(fèi)課

企業(yè)培訓(xùn)

鴻蒙開發(fā)者社區(qū)

WOT技術(shù)大會(huì)

公眾號(hào)矩陣

移動(dòng)端

視頻課免費(fèi)課排行榜短視頻直播課軟考學(xué)堂

全部課程軟考華為認(rèn)證廠商認(rèn)證 IT技術(shù)PMP項(xiàng)目管理免費(fèi)題庫

在線學(xué)習(xí)

文章資源問答課堂專欄直播

51CTO

鴻蒙開發(fā)者社區(qū)

51CTO技術(shù)棧

51CTO官微

51CTO學(xué)堂

51CTO博客

CTO訓(xùn)練營

鴻蒙開發(fā)者社區(qū)訂閱號(hào)

51CTO軟考

51CTO學(xué)堂APP

51CTO學(xué)堂企業(yè)版APP

鴻蒙開發(fā)者社區(qū)視頻號(hào)

51CTO軟考題庫

賬號(hào)設(shè)置退出

再見，正則表達(dá)式

作者：寫代碼的明哥 2020-09-04 09:16:04

開發(fā) 前端

從一段指定的字符串中，取得期望的數(shù)據(jù)，正常人都會(huì)想到正則表達(dá)式吧?寫過正則表達(dá)式的人都知道，正則表達(dá)式入門不難，寫起來也容易。

從一段指定的字符串中，取得期望的數(shù)據(jù)，正常人都會(huì)想到正則表達(dá)式吧?

寫過正則表達(dá)式的人都知道，正則表達(dá)式入門不難，寫起來也容易。

但是正則表達(dá)式幾乎沒有可讀性可言，維護(hù)起來，真的會(huì)讓人抓狂，別以為這段正則是你寫的就可以駕馭它，過個(gè)一個(gè)月你可能就不認(rèn)識(shí)它了。

完全可以說，天下苦正則久矣。

今天給你介紹一個(gè)好東西，可以讓你擺脫正則的噩夢，那就是 Python 中一個(gè)非常冷門的庫 -- parse 。

1. 真實(shí)案例

拿一個(gè)最近使用 parse 的真實(shí)案例來舉例說明。

下面是 ovs 一個(gè)條流表，現(xiàn)在我需要收集提取一個(gè)虛擬機(jī)(網(wǎng)口)里有多少流量、多少包流經(jīng)了這條流表。也就是每個(gè) in_port 對應(yīng)的 n_bytes、n_packets 的值。

cookie=0x9816da8e872d717d, duration=298506.364s, table=0, n_packets=480, n_bytes=20160, priority=10,ip,in_port="tapbbdf080b-c2" actions=NORMAL

如果是你，你會(huì)怎么做呢?

先以逗號(hào)分隔開來，再以等號(hào)分隔取出值來?

你不防可以嘗試一下，寫出來的代碼應(yīng)該和我想象的一樣，沒有一絲美感而言。

我來給你展示一下，我是怎么做的?

可以看到，我使用了一個(gè)叫做 parse 的第三方包，是需要自行安裝的

$ python -m pip install parse

從上面這個(gè)案例中，你應(yīng)該能感受到 parse 對于解析規(guī)范的字符串，是非常強(qiáng)大的。

2. parse 的結(jié)果

parse 的結(jié)果只有兩種結(jié)果：

沒有匹配上，parse 的值為None

>>> parse("halo", "hello") is None 
True 
>>>

如果匹配上，parse 的值則為 Result 實(shí)例

>>> parse("hello", "hello world") 
>>> parse("hello", "hello") 
<Result () {}> 
>>>

如果你編寫的解析規(guī)則，沒有為字段定義字段名，也就是匿名字段， Result 將是一個(gè) 類似 list 的實(shí)例，演示如下：

>>> profile = parse("I am {}, {} years old, {}", "I am Jack, 27 years old, male") 
>>> profile 
<Result ('Jack', '27', 'male') {}> 
>>> profile[0] 
'Jack' 
>>> profile[1] 
'27' 
>>> profile[2] 
'male'

而如果你編寫的解析規(guī)則，為字段定義了字段名， Result 將是一個(gè) 類似字典的實(shí)例，演示如下：

>>> profile = parse("I am {name}, {age} years old, {gender}", "I am Jack, 27 years old, male") 
>>> profile 
<Result () {'gender': 'male', 'age': '27', 'name': 'Jack'}> 
>>> profile['name'] 
'Jack' 
>>> profile['age'] 
'27' 
>>> profile['gender'] 
'male'

3. 重復(fù)利用 pattern

和使用 re 一樣，parse 同樣支持 pattern 復(fù)用。

>>> from parse import compile 
>>>  
>>> pattern = compile("I am {}, {} years old, {}") 
>>> pattern.parse("I am Jack, 27 years old, male") 
<Result ('Jack', '27', 'male') {}> 
>>>  
>>> pattern.parse("I am Tom, 26 years old, male") 
<Result ('Tom', '26', 'male') {}>

4. 類型轉(zhuǎn)化

從上面的例子中，你應(yīng)該能注意到，parse 在獲取年齡的時(shí)候，變成了一個(gè)"27" ，這是一個(gè)字符串，有沒有一種辦法，可以在提取的時(shí)候就按照我們的類型進(jìn)行轉(zhuǎn)換呢?

你可以這樣寫。

>>> from parse import parse 
>>> profile = parse("I am {name}, {age:d} years old, {gender}", "I am Jack, 27 years old, male") 
>>> profile 
<Result () {'gender': 'male', 'age': 27, 'name': 'Jack'}> 
>>> type(profile["age"]) 
<type 'int'>

除了將其轉(zhuǎn)為整型，還有其他格式嗎?

內(nèi)置的格式還有很多，比如

匹配時(shí)間

>>> parse('Meet at {:tg}', 'Meet at 1/2/2011 11:00 PM') 
<Result (datetime.datetime(2011, 2, 1, 23, 0),) {}>

更多類型請參考官方文檔：

Type	Characters Matched	Output
l	Letters (ASCII)	str
w	Letters, numbers and underscore	str
W	Not letters, numbers and underscore	str
s	Whitespace	str
S	Non-whitespace	str
d	Digits (effectively integer numbers)	int
D	Non-digit	str
n	Numbers with thousands separators (, or .)	int
%	Percentage (converted to value/100.0)	float
f	Fixed-point numbers	float
F	Decimal numbers	Decimal
e	Floating-point numbers with exponent e.g. 1.1e-10, NAN (all case insensitive)	float
g	General number format (either d, f or e)	float
b	Binary numbers	int
o	Octal numbers	int
x	Hexadecimal numbers (lower and upper case)	int
ti	ISO 8601 format date/time e.g. 1972-01-20T10:21:36Z (“T” and “Z” optional)	datetime
te	RFC2822 e-mail format date/time e.g. Mon, 20 Jan 1972 10:21:36 +1000	datetime
tg	Global (day/month) format date/time e.g. 20/1/1972 10:21:36 AM +1:00	datetime
ta	US (month/day) format date/time e.g. 1/20/1972 10:21:36 PM +10:30	datetime
tc	ctime() format date/time e.g. Sun Sep 16 01:03:52 1973	datetime
th	HTTP log format date/time e.g. 21/Nov/2011:00:07:11 +0000	datetime
ts	Linux system log format date/time e.g. Nov 9 03:37:44	datetime
tt	Time e.g. 10:21:36 PM -5:30	time

5. 提取時(shí)去除空格

去除兩邊空格

>>> parse('hello {} , hello python', 'hello     world    , hello python') 
<Result ('    world   ',) {}> 
>>>  
>>>  
>>> parse('hello {:^} , hello python', 'hello     world    , hello python') 
<Result ('world',) {}>

去除左邊空格

>>> parse('hello {:>} , hello python', 'hello     world    , hello python') 
<Result ('world   ',) {}>

去除右邊空格

>>> parse('hello {:<} , hello python', 'hello     world    , hello python') 
<Result ('    world',) {}>

6. 大小寫敏感開關(guān)

Parse 默認(rèn)是大小寫不敏感的，你寫 hello 和 HELLO 是一樣的。

如果你需要區(qū)分大小寫，那可以加個(gè)參數(shù)，演示如下：

>>> parse('SPAM', 'spam') 
<Result () {}> 
>>> parse('SPAM', 'spam') is None 
False 
>>> parse('SPAM', 'spam', case_sensitive=True) is None 
True

7. 匹配字符數(shù)

精確匹配：指定最大字符數(shù)

>>> parse('{:.2}{:.2}', 'hello')  # 字符數(shù)不符 
>>>  
>>> parse('{:.2}{:.2}', 'hell')   # 字符數(shù)相符 
<Result ('he', 'll') {}>

模糊匹配：指定最小字符數(shù)

>>> parse('{:.2}{:2}', 'hello')  
<Result ('h', 'ello') {}> 
>>>  
>>> parse('{:2}{:2}', 'hello')  
<Result ('he', 'llo') {}>

若要在精準(zhǔn)/模糊匹配的模式下，再進(jìn)行格式轉(zhuǎn)換，可以這樣寫

>>> parse('{:2}{:2}', '1024')  
<Result ('10', '24') {}> 
>>>  
>>>  
>>> parse('{:2d}{:2d}', '1024')  
<Result (10, 24) {}>

8. 三個(gè)重要屬性

Parse 里有三個(gè)非常重要的屬性

fixed：利用位置提取的匿名字段的元組
named：存放有命名的字段的字典
spans：存放匹配到字段的位置

下面這段代碼，帶你了解他們之間有什么不同

>>> profile = parse("I am {name}, {age:d} years old, {}", "I am Jack, 27 years old, male") 
>>> profile.fixed 
('male',) 
>>> profile.named 
{'age': 27, 'name': 'Jack'} 
>>> profile.spans 
{0: (25, 29), 'age': (11, 13), 'name': (5, 9)} 
>>>

9. 自定義類型的轉(zhuǎn)換

匹配到的字符串，會(huì)做為參數(shù)傳入對應(yīng)的函數(shù)

比如我們之前講過的，將字符串轉(zhuǎn)整型

>>> parse("I am {:d}", "I am 27") 
<Result (27,) {}> 
>>> type(_[0]) 
<type 'int'> 
>>>

其等價(jià)于

>>> def myint(string): 
...     return int(string) 
...  
>>>  
>>>  
>>> parse("I am {:myint}", "I am 27", dict(myint=myint)) 
<Result (27,) {}> 
>>> type(_[0]) 
<type 'int'> 
>>>

利用它，我們可以定制很多的功能，比如我想把匹配的字符串弄成全大寫

>>> def shouty(string): 
...    return string.upper() 
... 
>>> parse('{:shouty} world', 'hello world', dict(shouty=shouty)) 
<Result ('HELLO',) {}> 
>>>

10 總結(jié)一下

parse 庫在字符串解析處理場景中提供的便利，肉眼可見，上手簡單。

在一些簡單的場景中，使用 parse 可比使用 re 去寫正則開發(fā)效率不知道高幾個(gè) level，用它寫出來的代碼富有美感，可讀性高，后期維護(hù)起代碼來一點(diǎn)壓力也沒有，推薦你使用。

本文轉(zhuǎn)載自微信公眾號(hào)「 Python編程時(shí)光」，可以通過以下二維碼關(guān)注。轉(zhuǎn)載本文請聯(lián)系 Python編程時(shí)光公眾號(hào)。

責(zé)任編輯：武曉燕來源： Python編程時(shí)光

Python 正則表達(dá)式虛擬機(jī)

點(diǎn)贊

51CTO技術(shù)棧公眾號(hào)

業(yè)務(wù)
速覽

媒體

51CTO CIOAge HC3i

社區(qū)

51CTO博客鴻蒙開發(fā)者社區(qū) AI.x社區(qū)

教育

51CTO學(xué)堂精培企業(yè)培訓(xùn) CTO訓(xùn)練營

<sub id="84rze"></sub>

<cite id="84rze"><rp id="84rze"><pre id="84rze"></pre></rp></cite>

^{<blockquote id="84rze"></blockquote>}

<sup id="84rze"><rt id="84rze"></rt></sup>