久草面视频免费播放,国产成人精品AV在线观,成人毛片视频在线观看,天天摸日日摸人人看,亚洲AV无码一区二区三区桃色,白丝自慰一区二区,国产成人无码AV一区二区网站,亚洲AV蜜桃永久无码精品性色

↑↑↑點擊上方藍(lán)字，回復(fù)資料，10個G的驚喜

機(jī)器之心編輯部

TensorFlow 決策森林 (TF-DF) 現(xiàn)已開源，該庫集成了眾多 SOTA 算法，不需要輸入特征，可以處理數(shù)值和分類特征，為開發(fā)者節(jié)省了大量時間。

在人工智能發(fā)展史上，各類算法可謂層出不窮。近十幾年來，深層神經(jīng)網(wǎng)絡(luò)的發(fā)展在機(jī)器學(xué)習(xí)領(lǐng)域取得了顯著進(jìn)展。通過構(gòu)建分層或「深層」結(jié)構(gòu)，模型能夠在有監(jiān)督或無監(jiān)督的環(huán)境下從原始數(shù)據(jù)中學(xué)習(xí)良好的表征，這被認(rèn)為是其成功的關(guān)鍵因素。

而深度森林，是 AI 領(lǐng)域重要的研究方向之一。

2017 年，周志華和馮霽等人提出了深度森林框架，這是首次嘗試使用樹集成來構(gòu)建多層模型的工作。2018 年，周志華等人又在研究《Multi-Layered Gradient Boosting Decision Trees》中探索了多層的決策樹。今年 2 月，周志華團(tuán)隊開源深度森林軟件包 DF21：訓(xùn)練效率高、超參數(shù)少，在普通設(shè)備就能運(yùn)行。

就在近日，TensorFlow 開源了 TensorFlow 決策森林 (TF-DF)。TF-DF 是用于訓(xùn)練、服務(wù)和解釋決策森林模型（包括隨機(jī)森林和梯度增強(qiáng)樹）生產(chǎn)方面的 SOTA 算法集合?，F(xiàn)在，你可以使用這些模型進(jìn)行分類、回歸和排序任務(wù)，具有 TensorFlow 和 Keras 的靈活性和可組合性。

谷歌大腦研究員、Keras之父Fran?ois Chollet表示：「現(xiàn)在可以用Keras API訓(xùn)練TensorFlow決策森林了?！?/span>

對于這一開源項目，網(wǎng)友表示：「這非?？?！隨機(jī)森林是我最喜歡的模型?！?/span>

決策森林

決策森林是一系列機(jī)器學(xué)習(xí)算法，其質(zhì)量和速度可與神經(jīng)網(wǎng)絡(luò)相競爭（它比神經(jīng)網(wǎng)絡(luò)更易于使用，功能也很強(qiáng)大），實際上與特定類型的數(shù)據(jù)配合使用時，它們比神經(jīng)網(wǎng)絡(luò)更出色，尤其是在處理表格數(shù)據(jù)時。

隨機(jī)森林是一種流行的決策森林模型。在這里，你可以看到一群樹通過投票結(jié)果對一個例子進(jìn)行分類。

決策森林是由許多決策樹構(gòu)建的，它包括隨機(jī)森林和梯度提升樹等。這使得它們易于使用和理解，而且可以利用已經(jīng)存在的大量可解釋性工具和技術(shù)進(jìn)行操作。

決策樹是一系列僅需做出是 / 否判斷的問題，使用決策樹將動物分成雞、貓、袋鼠。

TF-DF 為 TensorFlow 用戶帶來了模型和一套定制工具：

對初學(xué)者來說，開發(fā)和解釋決策森林模型更容易。不需要顯式地列出或預(yù)處理輸入特征（因為決策森林可以自然地處理數(shù)字和分類屬性）、指定體系架構(gòu)（例如，通過嘗試不同的層組合，就像在神經(jīng)網(wǎng)絡(luò)中一樣），或者擔(dān)心模型發(fā)散。一旦你的模型經(jīng)過訓(xùn)練，你就可以直接繪制它或者用易于解釋的統(tǒng)計數(shù)據(jù)來分析它。
高級用戶將受益于推理時間非?？斓哪Ｐ停ㄔ谠S多情況下，每個示例的推理時間為亞微秒）。而且，這個庫為模型實驗和研究提供了大量的可組合性。特別是，將神經(jīng)網(wǎng)絡(luò)和決策森林相結(jié)合是很容易的。

如上圖所示，只需使用一行代碼就能構(gòu)建模型，相比之下，動圖中的下面代碼是用于構(gòu)建神經(jīng)網(wǎng)絡(luò)的代碼。在 TensorFlow 中，決策森林和神經(jīng)網(wǎng)絡(luò)都使用 Keras?？梢允褂孟嗤?API 來實驗不同類型的模型，更重要的是，可以使用相同的工具，例如 TensorFlow Serving 來部署這兩種模型。

以下是 TF-DF 提供的一些功能：

TF-DF 提供了一系列 SOTA 決策森林訓(xùn)練和服務(wù)算法，如隨機(jī)森林、CART、（Lambda）MART、DART 等。
基于樹的模型與各種 TensorFlow 工具、庫和平臺（如 TFX）更容易集成，TF-DF 庫可以作為通向豐富 TensorFlow 生態(tài)系統(tǒng)的橋梁。
對于神經(jīng)網(wǎng)絡(luò)用戶，你可以使用決策森林這種簡單的方式開始 TensorFlow，并繼續(xù)探索神經(jīng)網(wǎng)絡(luò)。

代碼示例

下面進(jìn)行示例展示，可以讓使用者簡單明了。

項目地址：https://github.com/tensorflow/decision-forests
TF-DF 網(wǎng)站地址：https://www.tensorflow.org/decision_forests
Google I/O 2021 地址：https://www.youtube.com/watch?v=5qgk9QJ4rdQ

模型訓(xùn)練

在數(shù)據(jù)集 Palmer's Penguins 上訓(xùn)練隨機(jī)森林模型。目的是根據(jù)一種動物的特征來預(yù)測它的種類。該數(shù)據(jù)集包含數(shù)值和類別特性，并存儲為 csv 文件。

Palmer's Penguins 數(shù)據(jù)集示例。

模型訓(xùn)練代碼：

# Install TensorFlow Decision Forests!pip install tensorflow_decision_forests# Load TensorFlow Decision Forestsimport tensorflow_decision_forests as tfdf# Load the training dataset using pandasimport pandastrain_df = pandas.read_csv("penguins_train.csv")# Convert the pandas dataframe into a TensorFlow datasettrain_ds = tfdf.keras.pd_dataframe_to_tf_dataset(train_df, label="species")# Train the modelmodel = tfdf.keras.RandomForestModel()model.fit(train_ds)

請注意，代碼中沒有提供輸入特性或超參數(shù)。這意味著，TensorFlow 決策森林將自動檢測此數(shù)據(jù)集中的輸入特征，并對所有超參數(shù)使用默認(rèn)值。

評估模型

現(xiàn)在開始對模型的質(zhì)量進(jìn)行評估：

# Load the testing datasettest_df = pandas.read_csv("penguins_test.csv")# Convert it to a TensorFlow datasettest_ds = tfdf.keras.pd_dataframe_to_tf_dataset(test_df, label="species")# Evaluate the modelmodel.compile(metrics=["accuracy"])print(model.evaluate(test_ds))# >> 0.979311# Note: Cross-validation would be more suited on this small dataset.# See also the "Out-of-bag evaluation" below.# Export the model to a TensorFlow SavedModelmodel.save("project/my_first_model")

帶有默認(rèn)超參數(shù)的隨機(jī)森林模型為大多數(shù)問題提供了一個快速和良好的基線。決策森林一般會對中小尺度問題進(jìn)行快速訓(xùn)練，與其他許多類型的模型相比，需要較少的超參數(shù)調(diào)優(yōu)，并且通常會提供強(qiáng)大的結(jié)果。

解讀模型

現(xiàn)在，你已經(jīng)了解了所訓(xùn)練模型的準(zhǔn)確率，接下來該考慮它的可解釋性了。如果你希望理解和解讀正被建模的現(xiàn)象、調(diào)試模型或者開始信任其決策，可解釋性就變得非常重要了。如上所述，有大量的工具可用來解讀所訓(xùn)練的模型。首先從 plot 開始：

tfdf.model_plotter.plot_model_in_colab(model, tree_idx=0)

其中一棵決策樹的結(jié)構(gòu)。

你可以直觀地看到樹結(jié)構(gòu)。此外，模型統(tǒng)計是對 plot 的補(bǔ)充，統(tǒng)計示例包括：

每個特性使用了多少次？
模型訓(xùn)練的速度有多快（樹的數(shù)量和時間）？
節(jié)點在樹結(jié)構(gòu)中是如何分布的（比如大多數(shù) branch 的長度）？

這些問題的答案以及更多類似查詢的答案都包含在模型概要中，并可以在模型檢查器中訪問。

# Print all the available information about the modelmodel.summary()>> Input Features (7):>>   bill_depth_mm>>   bill_length_mm>>   body_mass_g>>   ...>> Variable Importance:>>   1.    "bill_length_mm" 653.000000 ################>>   ...>> Out-of-bag evaluation: accuracy:0.964602 logloss:0.102378>> Number of trees: 300>> Total number of nodes: 4170>>   ...# Get feature importance as a arraymodel.make_inspector().variable_importances()["MEAN_DECREASE_IN_ACCURACY"]>> [("flipper_length_mm", 0.149),>>      ("bill_length_mm", 0.096),>>      ("bill_depth_mm", 0.025),>>      ("body_mass_g", 0.018),>>      ("island", 0.012)]

在上述示例中，模型通過默認(rèn)超參數(shù)值進(jìn)行訓(xùn)練。作為首個解決方案而言非常好，但是調(diào)整超參數(shù)可以進(jìn)一步提升模型的質(zhì)量?？梢匀缦逻@樣做：

# List all the other available learning algorithmstfdf.keras.get_all_models()>> [tensorflow_decision_forests.keras.RandomForestModel,>>  tensorflow_decision_forests.keras.GradientBoostedTreesModel,>>  tensorflow_decision_forests.keras.CartModel]# Display the hyper-parameters of the Gradient Boosted Trees model ? tfdf.keras.GradientBoostedTreesModel>> A GBT (Gradient Boosted [Decision] Tree) is a set of shallow decision trees trained sequentially. Each tree is trained to predict and then "correct" for the errors of the previously trained trees (more precisely each tree predicts the gradient of the loss relative to the model output)..    ...   Attributes:     num_trees: num_trees: Maximum number of decision trees. The effective number of trained trees can be smaller if early stopping is enabled. Default: 300.     max_depth: Maximum depth of the tree. `max_depth=1` means that all trees will be roots. Negative values are ignored. Default: 6.     ...     # Create another model with specified hyper-parametersmodel = tfdf.keras.GradientBoostedTreesModel(    num_trees=500,    growing_strategy="BEST_FIRST_GLOBAL",    max_depth=8,    split_axis="SPARSE_OBLIQUE"    ,)# Evaluate the modelmodel.compile(metrics=["accuracy"])print(model.evaluate(test_ds))# >> 0.986851

參考鏈接：

https://blog.tensorflow.org/2021/05/introducing-tensorflow-decision-forests.html

周志華教授機(jī)器學(xué)習(xí)三件套


也可以加一下老胡的微信
圍觀朋友圈~~~

推薦閱讀
（點擊標(biāo)題可跳轉(zhuǎn)閱讀）
微軟這個太強(qiáng)了
人工智能有多智障？
在公司內(nèi)網(wǎng)搭建 pip 鏡像站
【收藏】最全的Python常用標(biāo)準(zhǔn)庫及第三方庫
為什么美國學(xué)生學(xué)的數(shù)學(xué)比我們簡單，卻能做出很牛逼的東西？
老鐵，三連支持一下，好嗎？↓↓↓

【機(jī)器學(xué)習(xí)】隨機(jī)森林是我最喜歡的模型