Prompt Gallery

信息图 / 教育图解 / 图表

Scrapbook Byte-level BPE 原理解析

此提示词旨在生成一张宽幅手绘风格的教育信息图,通过可爱的吉祥物角色,以亲切的中文科普风格解释 byte-level BPE 分词原理。

ID
15292
作者
程序员Left
标签
信息图 / 教育图解 / 图表 / 角色 / IP / 贴纸 / VTuber / 商业海报 / 广告 / 社媒

中文提示词

一张可爱的横向教育信息图,采用手绘剪贴簿风格,背景为柔和的桌面和纸张拼贴,旨在解释基于 byte-level BPE 的中文分词原理。画面被分为 3 个清晰的教学区域,从左至右横跨整个宽幅横幅。最左侧站着一只可爱的柴犬吉祥物,{argument name="character name" default="柴小七"},拥有温暖的棕褐色和奶油色毛发、圆脸、小三角耳、红润的脸颊和好奇的表情,手里拿着一个杯子,站在一张带有抽屉、铅笔和椅子的书桌旁。在柴犬上方是一个粗体圆角的白色标题框,内含黑色中文字符:{argument name="headline text" default="中文分词:Byte-level BPE(BBPE)流程科普"}。在靠近上方的第一个教学区域,展示 4 个排列在木架上的半透明蓝色 Token 形状方块,每个方块上标注“Token”,并配有一个弯曲的箭头和手写中文注释“词频语料统计”,指向下一步。在第二个区域,放置一个巨大的放大镜,突出显示 3 个标有“E7”、“94”和“B5”的频率方块,区域标签写在黄色便签上,内容为“2. 频率统计与合并”;在放大区域下方和内部包含一个巨大的黑色汉字“电”,旁边附有手写注释“频繁字节对”。在第三个区域的中下方,添加一个木制标牌和一个更大的合并后的半透明蓝色 Token 方块,上面标注“Token”,并附有一张黄色便签,写着“3. 跨字合并”,大号黑色中文字符“我们→”,下方配有说明条“高频词组合并为 Token”。在最右侧,展示最终的解释结果,3 个标有“E7”、“94”和“B5”的小字节方块位于一个巨大的黑色汉字“电”上方,旁边是一张写有“1. 字节级编码(UTF-8)”的便签卡,下方是巨大的黑色中文词汇“我们”。用粉色、蓝色和绿色的弯曲箭头连接各个区域,以展示流程走向。在底部中央附近包含一个带有细小四肢、面带微笑并挥手的蓝色 Token 吉祥物。使用柔和的奶油色、粉色、米色和浅蓝色,线条厚实清晰,采用贴纸般的剪裁形状、胶带纸角、笔记本纹理、边缘散落的铅笔,以及适合科普图表的友好手账插画风格。

原始提示词

A cute horizontal educational infographic in a hand-drawn scrapbook style on a pastel desk-and-paper collage background, explaining Chinese word segmentation with byte-level BPE. The scene is divided into 3 clearly separated instructional sections arranged from left to right across a wide banner. On the far left stands a chibi Shiba Inu mascot, {argument name="character name" default="柴小七"}, with warm tan and cream fur, round face, small triangular ears, rosy cheeks, and a curious expression, holding a cup and standing beside a small desk with drawers, pencils, and a chair. Above the dog is a bold rounded white title box with black Chinese text: {argument name="headline text" default="中文分词:Byte-level BPE(BBPE)流程科普"}. In the first teaching section near the upper middle, show 4 small translucent blue token-shaped tiles lined up on a wooden shelf, each labeled “Token”, with a curved arrow and small handwritten Chinese note “词频语料统计” pointing toward the next step. In the second section, place a large magnifying glass highlighting 3 small frequency tiles labeled exactly “E7”, “94”, and “B5”, with the section label in a yellow note reading “2. 频率统计与合并”; beneath and inside the magnified area include a large black Chinese character “电”, and nearby a handwritten note “频繁字节对”. In the third section at lower middle-left, add a wooden sign and a larger merged translucent blue token tile labeled “Token”, with a small yellow note reading “3. 跨字合并”, large black Chinese text “我们→”, and a caption strip below that says “高频词组合并为 Token”. On the far right, show the final explanatory result with 3 small byte boxes labeled “E7”, “94”, and “B5” above a large black Chinese character “电”, then a note card reading “1. 字节级编码(UTF-8)”, and below that the large black Chinese word “我们”. Connect the sections with curved arrows in pink, blue, and green to show process flow. Include 1 animated blue token mascot with tiny arms and legs near the center-bottom, smiling and waving. Use soft cream, pink, beige, and light blue colors, thick clean outlines, sticker-like cutout shapes, taped paper corners, notebook textures, pencils around the edges, and a friendly hand-account illustration style suitable for a science explainer graphic.