概述
该数据集包括超过 40 万个手写姓名,它们都是通过慈善项目收集的,旨在支持世界各地的弱势儿童。
光学字符识别 (OCR) 利用图像处理技术将扫描文档上的字符转换为数字形式。通常而言,这种技术识别机器打印字体的质量非常高。不过,个人书写风格的巨大差异给通过机器识别手写字符带来了诸多艰巨挑战。
该数据集总共包括 206,799 个姓氏和 207,024 个名字,分为训练集(331,059 个)、测试集(41,382 个)和验证集(41,382 个)。
此外,它还提供了通过澳鹏平台上的人工辅助标注功能创建的所有图像标签,使您可以使用自己的数据扩展该数据集。
该任务的输入数据是成千上万个手写姓名的图像。点击上方的“数据 (Data)”标签,您会发现转录后的图像分为测试集、训练集和验证集。
Image | url | |||||
---|---|---|---|---|---|---|
D2M | 15 | 0010079F | 0002 | 1 | first | name.jpg |
D2M | 15 | 0010079F | 0002 | 1 | surname.jpg | |
D2M | 15 | 0010079F | 0003 | 2 | surname.jpg | |
D2M | 15 | 0010079F | 0004 | 3 | first | name.jpg |
D2M | 15 | 0010079F | 0004 | 3 | surname.jpg | |
D2M | 15 | 0010079F | 0005 | 4 | first | name.jpg |
D2M | 15 | 0010079F | 0006 | 5 | first | name.jpg |
D2M | 15 | 0010079F | 0006 | 5 | surname.jpg | |
D2M | 15 | 0010079F | 0007 | 6 | first | name.jpg |
数据

written_name_test_v2.csv
Filename | Identity |
---|---|
D2M_15_0010079F_0002_1_first_name.jpg | HIMELIN |
D2M_15_0010079F_0002_1_surname.jpg | CAYRE |
D2M_15_0010079F_0003_2_surname.jpg | DURAND |
D2M_15_0010079F_0004_3_first_name.jpg | YOANN |
D2M_15_0010079F_0004_3_surname.jpg | VENET |
D2M_15_0010079F_0005_4_first_name.jpg | MARTIN |
D2M_15_0010079F_0006_5_first_name.jpg | CLEMENT |
D2M_15_0010079F_0006_5_surname.jpg | LEANDRE |
D2M_15_0010079F_0007_6_first_name.jpg | PRISCILLE |
D2M_15_0010079F_0008_7_first_name.jpg | VALENTIN |
D2M_15_0010079F_0008_7_surname.jpg | TISSANDIER |
D2M_15_0010079F_0009_8_first_name.jpg | LEO |
D2M_15_0010079F_0009_8_surname.jpg | PEREZ |
D2M_15_0010079F_0010_9_first_name.jpg | LAURINE |
D2M_15_0010079F_0010_9_surname.jpg | PARISOT |
D2M_15_0010079F_0011_10_first_name.jpg | TATIANA-EVA |
D2M_15_0010079F_0011_10_surname.jpg | ZIEGELBAUNBORSKI |
D2M_15_0010079F_0012_11_first_name.jpg | JUITETTE |
D2M_15_0010079F_0013_12_first_name.jpg | NOAH |
D2M_15_0010079F_0014_13_surname.jpg | PAVLOVIC |
D2M_15_0010079F_0015_14_surname.jpg | PONTHUS |
D2M_15_0010079F_0016_15_first_name.jpg | ALICE |
D2M_15_0010079F_0016_15_surname.jpg | GUEGAN |
D2M_15_0010079F_0017_16_first_name.jpg | ESTELLE |
D2M_15_0010079F_0017_16_surname.jpg | GIRERD-MARTIN |
You are previewing the first 25 rows of this dataset. Download this file for the full dataset.
Column Name | Column Description |
---|---|
FILENAME | URL of handwritten image |
IDENTITY | Transcribed handwritten name |

written_name_train_v2.csv
FILENAME | IDENTITY |
---|---|
D2M_15_0010079F_0002_1_first_name.jpg | HIMELIN |
D2M_15_0010079F_0002_1_surname.jpg | CAYRE |
D2M_15_0010079F_0003_2_surname.jpg | DURAND |
D2M_15_0010079F_0004_3_first_name.jpg | YOANN |
D2M_15_0010079F_0004_3_surname.jpg | VENET |
D2M_15_0010079F_0005_4_first_name.jpg | MARTIN |
D2M_15_0010079F_0006_5_first_name.jpg | CLEMENT |
D2M_15_0010079F_0006_5_surname.jpg | LEANDRE |
D2M_15_0010079F_0007_6_first_name.jpg | PRISCILLE |
D2M_15_0010079F_0008_7_first_name.jpg | VALENTIN |
D2M_15_0010079F_0008_7_surname.jpg | TISSANDIER |
D2M_15_0010079F_0009_8_first_name.jpg | LEO |
D2M_15_0010079F_0009_8_surname.jpg | PEREZ |
D2M_15_0010079F_0010_9_first_name.jpg | LAURINE |
D2M_15_0010079F_0010_9_surname.jpg | PARISOT |
D2M_15_0010079F_0011_10_first_name.jpg | TATIANA-EVA |
D2M_15_0010079F_0011_10_surname.jpg | ZIEGELBAUNBORSKI |
D2M_15_0010079F_0012_11_first_name.jpg | JUITETTE |
D2M_15_0010079F_0013_12_first_name.jpg | NOAH |
D2M_15_0010079F_0014_13_surname.jpg | PAVLOVIC |
D2M_15_0010079F_0015_14_surname.jpg | PONTHUS |
D2M_15_0010079F_0016_15_first_name.jpg | ALICE |
D2M_15_0010079F_0016_15_surname.jpg | GUEGAN |
D2M_15_0010079F_0017_16_first_name.jpg | ESTELLE |
D2M_15_0010079F_0017_16_surname.jpg | GIRERD-MARTIN |
You are previewing the first 25 rows of this dataset. Download this file for the full dataset.
Column Name | Column Description |
---|---|
FILENAME | URL of handwritten image |
IDENTITY | Transcribed handwritten name |

written_name_validation_v2.csv
FILENAME | IDENTITY |
---|---|
D2M_15_0010079F_0003_2_first_name.jpg | ROBIN |
D2M_15_0010079F_0005_4_surname.jpg | CHAMBARO |
D2M_15_0010079F_0007_6_surname.jpg | PERROT |
D2M_15_0010079F_0025_24_first_name.jpg | LAURENCIANE |
D2M_15_0010079F_0025_24_surname.jpg | CAYRE |
D2M_15_0010079F_0031_30_surname.jpg | FEBNANDES |
D2M_15_0010079F_0037_36_first_name.jpg | ELVIS |
D2M_15_0010079F_0039_38_first_name.jpg | MAXENCE |
D2M_15_0010079F_0042_41_surname.jpg | VILLARD |
D2M_15_0010079F_0063_62_surname.jpg | B |
D2M_15_0010079F_0074_73_first_name.jpg | COGNAC |
D2M_15_0010079F_0081_80_first_name.jpg | THEN |
D2M_15_0010079F_0104_103_first_name.jpg | NATHAN |
D2M_15_0010079F_0108_107_first_name.jpg | NAEL |
D2M_15_0010088R_0010_119_first_name.jpg | JEROME |
D2M_15_0010088R_0014_123_first_name.jpg | NINA |
D2M_15_0010088R_0023_132_surname.jpg | MARCHAND |
D2M_15_0010088R_0026_135_surname.jpg | QUARROZ |
D2M_15_0010088R_0043_152_surname.jpg | DEGODET |
D2M_15_0010088R_0045_154_first_name.jpg | TIZIA |
D2M_15_0010088R_0046_155_surname.jpg | BACHMANN |
D2M_15_0010088R_0051_160_surname.jpg | SHARILLOU |
D2M_15_0010088R_0053_162_surname.jpg | FARHAT |
D2M_15_0010088R_0054_163_first_name.jpg | GABY |
D2M_15_0010088R_0063_172_surname.jpg | MADIER |
You are previewing the first 25 rows of this dataset. Download this file for the full dataset.
Column Name | Column Description |
---|---|
FILENAME | URL of handwritten image |
IDENTITY | Transcribed handwritten name |