Attention!: agree MSR-LA before downloading

Aligned face images:

Alt Purpose: Faces are aligned by MSR’s algorithm, and meant to let participants directly train models if they don’t have face detector and alignment modules at hand. We will use the same alignment approach on DevSet and MeasurementSet.

  • Download link
  • Some statistics:
    • # of Entities: 99,892
    • # of Lines: 8,456,240
    • Image Resolution: up to 300*300
    • Average Image # per Entity: 85
    • Total file size (uncompressed): 89 GB
  • File format: text files, each line is an image record containing 7 columns, delimited by TAB.
    • Column1: Freebase MID
    • Column2: ImageSearchRank
    • Column3: ImageURL
    • Column4: PageURL
    • Column5: FaceID
    • Column6: FaceRectangle_Base64Encoded (four floats, relative coordinates of UpperLeft and BottomRight corner)
    • Column7: FaceData_Base64Encoded d Data]( 14 entities, 14 entities, 32MB


  1. The data is released for non-commercial research purpose only. You have to read and agree the MSR Data License Agreement before you downloading the data;
  2. Please contact us If you are a celebrity but do not want to be included in this data set. We will remove related entries by request;
  3. In all the related publications, please cite the paper "MS-Celeb-1M: A Dataset and Benchmark for Large Scale Face Recognition" and provide the link to
    @INPROCEEDINGS { guo2016msceleb,
        author = {Guo, Yandong and Zhang, Lei and Hu, Yuxiao and He, Xiaodong and Gao, Jianfeng},
        title = {M{S}-{C}eleb-1{M}: A Dataset and Benchmark for Large Scale Face Recognition},
        booktitle = {European Conference on Computer Vision},
        year = {2016},