DaTagGe(Dabooru Tag Geerator) is ispired from p1atdev's dart project.
But with differet arch, dataset, format ad differet traiig strategy. This versio of DTG is traied from scratch with 400M param LLaMA arch.(I my persoal preferece I will call it NaoLLaMA)
Sice it is llama arch. Theoritically it should be able to be used i ay LLaMA iferece iterface. This repo also provided coverted FP16 gguf model ad quatized 8bit/6bit gguf models.
Basically it is recommeded to use llama.cpp or llama-cpp-pytho to ru this model. Which will be very fast. for example: Ad you may get somethig like: I use the traier I implemeted i HakuPhi to ru the traiig. (It should be HakuLLM ow LoL)
with 15epoch o 2M data ad 5epoch o 5.3M data. This model have roughly 6~12B toke see. The dataset is exported by HakuBooru with my dabooru sqlite database. Use the percetile of fav_cout o each ratig to filter the data. (2M = top 25%, 5.3M = top 75%) I'm implemetig a gradio UI for this thig ad other dev ca utilize the API i it to make differet app.
I'm also plaig to make sd-webui extesio.DaTagGe - V1
Model arch
Format
prompt = f"""
ratig: {ratig or '<|empty|>'}
artist: {artist.strip() or '<|empty|>'}
characters: {characters.strip() or '<|empty|>'}
copyrights: {copyrights.strip() or '<|empty|>'}
aspect ratio: {f"{aspect_ratio:.1f}" or '<|empty|>'}
target: {'<|' + target + '|>' if target else '<|log|>'}
geeral: {", ".joi(special_tags)}, {geeral.strip().strip(",")}<|iput_ed|>
"""
ratig: safe
artist: <|empty|>
characters: <|empty|>
copyrights: <|empty|>
aspect ratio: 1.0
target: <|short|>
geeral: 1girl, solo, drago girl, drago hors, drago tail<|iput_ed|>
ratig: safe
artist: <|empty|>
characters: <|empty|>
copyrights: <|empty|>
aspect ratio: 1.0
target: <|short|>
geeral: 1girl, solo, drago girl, drago hors, drago tail<|iput_ed|>ope mouth, red eyes, log hair, poity ears, tail, black hair, chiese clothes, simple backgroud, drago, hair betwee eyes, hors, chia dress, dress, lookig at viewer, breasts
Dataset ad Traiig
Utilities
点击空白处退出提示
评论