返回画廊数据整理自 YouMind 公共 GitHub/页面数据

Scrapbook Byte-level BPE 原理解析

此提示词旨在生成一张宽幅手绘风格的教育信息图,通过可爱的吉祥物角色,以亲切的中文科普风格解释 byte-level BPE 分词原理。

Prompt 正文

默认展示英文原文。复制时按当前语言,回到首页后会同时保留中英文两份草稿。

A cute horizontal educational infographic in a hand-drawn scrapbook style on a pastel desk-and-paper collage background, explaining Chinese word segmentation with byte-level BPE. The scene is divided into 3 clearly separated instructional sections arranged from left to right across a wide banner. On the far left stands a chibi Shiba Inu mascot, {argument name="character name" default="柴小七"}, with warm tan and cream fur, round face, small triangular ears, rosy cheeks, and a curious expression, holding a cup and standing beside a small desk with drawers, pencils, and a chair. Above the dog is a bold rounded white title box with black Chinese text: {argument name="headline text" default="中文分词:Byte-level BPE(BBPE)流程科普"}. In the first teaching section near the upper middle, show 4 small translucent blue token-shaped tiles lined up on a wooden shelf, each labeled “Token”, with a curved arrow and small handwritten Chinese note “词频语料统计” pointing toward the next step. In the second section, place a large magnifying glass highlighting 3 small frequency tiles labeled exactly “E7”, “94”, and “B5”, with the section label in a yellow note reading “2. 频率统计与合并”; beneath and inside the magnified area include a large black Chinese character “电”, and nearby a handwritten note “频繁字节对”. In the third section at lower middle-left, add a wooden sign and a larger merged translucent blue token tile labeled “Token”, with a small yellow note reading “3. 跨字合并”, large black Chinese text “我们→”, and a caption strip below that says “高频词组合并为 Token”. On the far right, show the final explanatory result with 3 small byte boxes labeled “E7”, “94”, and “B5” above a large black Chinese character “电”, then a note card reading “1. 字节级编码(UTF-8)”, and below that the large black Chinese word “我们”. Connect the sections with curved arrows in pink, blue, and green to show process flow. Include 1 animated blue token mascot with tiny arms and legs near the center-bottom, smiling and waving. Use soft cream, pink, beige, and light blue colors, thick clean outlines, sticker-like cutout shapes, taped paper corners, notebook textures, pencils around the edges, and a friendly hand-account illustration style suitable for a science explainer graphic.