男女激情操逼一区福利网站,国产又黄又爽的免费视频,国产黑料视频你懂的,久久国产小视频,在线无码中文字幕,日韩无码19,可以免费看的黄色视频网站,三级日本黄色电影在线观看

RikoPython 流處理引擎

聯(lián)合創(chuàng)作 · 2023-09-30 05:51

Riko是一款Python 流處理引擎，類似Yahoo Pipes。采用純python開發(fā)，用于分析處理結(jié)構(gòu)化數(shù)據(jù)流。擁有同步和異步APIs，同時也支持并行RSS feeds。Riko也支持字符終端界面。

功能特性：

可讀取csv/xml/json/html文件。
通過模塊化的管道可創(chuàng)建文本流和數(shù)據(jù)流。
可解析、處理、提取RSS/Atom feeds。
可創(chuàng)建強大的混合型APIs和maps。
支持并行處理。

使用示例代碼：

>>> ### Create a SyncPipe flow ###
>>> #
>>> # `SyncPipe` is a convenience class that creates chainable flows
>>> # and allows for parallel processing.
>>> from riko.collections.sync import SyncPipe
>>>
>>> ### Set the pipe configurations ###
>>> #
>>> # Notes:
>>> #   1. the `detag` option will strip all html tags from the result
>>> #   2. fetch the text contained inside the 'body' tag of the hackernews
>>> #      homepage
>>> #   3. replace newlines with spaces and assign the result to 'content'
>>> #   4. tokenize the resulting text using whitespace as the delimeter
>>> #   5. count the number of times each token appears
>>> #   6. obtain the raw stream
>>> #   7. extract the first word and its count
>>> #   8. extract the second word and its count
>>> #   9. extract the third word and its count
>>> url = 'https://news.ycombinator.com/'
>>> fetch_conf = {
...     'url': url, 'start': '<body>', 'end': '</body>', 'detag': True}  # 1
>>>
>>> replace_conf = {
...     'rule': [
...         {'find': '\r\n', 'replace': ' '},
...         {'find': '\n', 'replace': ' '}]}
>>>
>>> flow = (
...     SyncPipe('fetchpage', conf=fetch_conf)                           # 2
...         .strreplace(conf=replace_conf, assign='content')             # 3
...         .stringtokenizer(conf={'delimiter': ' '}, emit=True)         # 4
...         .count(conf={'count_key': 'content'}))                       # 5
>>>
>>> stream = flow.output                                                 # 6
>>> next(stream)                                                         # 7
{"'sad": 1}
>>> next(stream)                                                         # 8
{'(': 28}
>>> next(stream)                                                         # 9
{'(1999)': 1}

點贊

評論

編輯分享

舉報