<kbd id="afajh"><form id="afajh"></form></kbd><strong id="afajh"><dl id="afajh"></dl></strong>

<del id="afajh"><form id="afajh"></form></del>

<th id="afajh"><progress id="afajh"></progress></th>

<b id="afajh"><abbr id="afajh"></abbr></b>

<th id="afajh"><progress id="afajh"></progress></th>

StarCoder代碼生成語言模型

聯(lián)合創(chuàng)作 · 2023-09-25 23:24

StarCoder（150 億參數(shù)）是 Hugging Face 聯(lián)合 ServiceNow 發(fā)布的免費(fèi)大型語言模型，該模型經(jīng)過訓(xùn)練主要用途是可以生成代碼，目的是為了對抗 GitHub Copilot 和亞馬遜 CodeWhisperer 等基于 AI 的編程工具。

其訓(xùn)練數(shù)據(jù)包含 80 多種不同的編程語言以及從 GitHub 中提取的文本。

安裝

首先，我們必須安裝 requirements.txt 中列出的所有庫

pip install -r requirements.txt

代碼生成

代碼生成 pipeline 如下

from transformers import AutoModelForCausalLM, AutoTokenizer

checkpoint = "bigcode/starcoder"
device = "cuda" # for GPU usage or "cpu" for CPU usage

tokenizer = AutoTokenizer.from_pretrained(checkpoint)
# to save memory consider using fp16 or bf16 by specifying torch.dtype=torch.float16 for example
model = AutoModelForCausalLM.from_pretrained(checkpoint).to(device)

inputs = tokenizer.encode("def print_hello_world():", return_tensors="pt").to(device)
outputs = model.generate(inputs)
print(tokenizer.decode(outputs[0]))

或者

from transformers import AutoModelForCausalLM, AutoTokenizer, pipeline
checkpoint = "bigcode/starcoder"

model = AutoModelForCausalLM.from_pretrained(checkpoint)
tokenizer = AutoTokenizer.from_pretrained(checkpoint)

pipe = pipeline("text-generation", model=model, tokenizer=tokenizer, device=0)
print( pipe("def hello():") )

瀏覽 26

點(diǎn)贊

收藏

分享

舉報(bào)

評論

圖片

表情

StarCoder代碼生成語言模型

StarCoder（150億參數(shù)）是HuggingFace聯(lián)合ServiceNow發(fā)布的免費(fèi)大型語言模型，該模型經(jīng)過訓(xùn)練主要用途是可以生成代碼，目的是為了對抗GitHubCopilot和亞馬遜Code

CodeGeeX多語言代碼生成模型

CodeGeeX是一個具有130億參數(shù)的多編程語言代碼生成預(yù)訓(xùn)練模型。CodeGeeX采用華為Min

CodeFuse-13B代碼大語言模型

CodeFuse-13B是基于GPT-NeoX框架訓(xùn)練的13B參數(shù)代碼生成模型，能夠處理4096個字

CodeFuse-13B代碼大語言模型

CodeFuse-13B是基于GPT-NeoX框架訓(xùn)練的13B參數(shù)代碼生成模型，能夠處理4096個字符的代碼序列。該模型在1000BToken的代碼、中文、英文數(shù)據(jù)數(shù)據(jù)集上進(jìn)行預(yù)訓(xùn)練，覆蓋超過40種編

CodeGeeX多語言代碼生成模型

CodeGeeX是一個具有130億參數(shù)的多編程語言代碼生成預(yù)訓(xùn)練模型。CodeGeeX采用華為MindSpore框架實(shí)現(xiàn)，在鵬城實(shí)驗(yàn)室“鵬城云腦II”中的192個節(jié)點(diǎn)（共1536個國產(chǎn)昇騰910AI處

CodeGeeX2更強(qiáng)大的多語言代碼生成模型

CodeGeeX2 是多語言代碼生成模型?CodeGeeX?(KDD’23) 的第二代模型。不同于一

CodeGeeX2更強(qiáng)大的多語言代碼生成模型

CodeGeeX2是多語言代碼生成模型?CodeGeeX?(KDD’23)的第二代模型。不同于一代CodeGeeX（完全在國產(chǎn)華為昇騰芯片平臺訓(xùn)練），CodeGeeX2是基于?ChatGLM2?架構(gòu)注

ycssCSS 代碼生成

通過ycss，用戶只需要寫class名字就可以自動處理生成css?代碼。#ycssOnlyoneconfigurationisneeded,youcanautomaticallycompleteyou

Text Generation Inference大語言模型文本生成推理

用于文本生成推理的Rust、Python和gRPC服務(wù)器。在HuggingFace的生產(chǎn)中用于為LLM的api推理小部件提供支持。特性：使用簡單的啟動器為最流行的大型語言模型提供服務(wù)TensorPar

Text Generation Inference大語言模型文本生成推理

Text Generation Inference大語言模型文本生成推理

點(diǎn)贊

收藏

分享

舉報(bào)

<kbd id="afajh"><form id="afajh"></form></kbd><strong id="afajh"><dl id="afajh"></dl></strong>

<del id="afajh"><form id="afajh"></form></del>

<th id="afajh"><progress id="afajh"></progress></th>

<b id="afajh"><abbr id="afajh"></abbr></b>

<th id="afajh"><progress id="afajh"></progress></th>

操逼激情视频 | 亚洲AV成人无码精电影在线 | 中文字幕人成人乱 | 依人大香蕉视频网站 | 激情视频国产 |