首页
CtrlK

10-GPT4

10-GPT4

GPT4

一无所知

https://openai.com/research/gpt-4

对于GPT4的技术细节基本一无所知

2 Scope and Limitations of this Technical Report
This report focuses on the capabilities, limitations, and safety properties of GPT-4. GPT-4 is a Transformer-style model [39] pre-trained to predict the next token in a document, using both publicly available data (such as internet data) and data licensed from third-party providers. The model was then fine-tuned using Reinforcement Learning from Human Feedback (RLHF) [40]. Given both the competitive landscape and the safety implications of large-scale models like GPT-4, this report contains no further details about the architecture (including model size), hardware, training compute, dataset construction, training method, or similar.

看得见

GPT4 能看图了

辟谣 不能读图片链接

![image.png](https://cdn.modevol.com/user/ckxe4nrot01l501s5g2ehd6ge/images/o1fgqdle2u0b7jd9mu9ly7ng.png)image.png![](assets/10-GPT4/image-20240223172511387.png)

能力更强

在很多测试中,GPT-4 的分数都比 GPT3.5 高

更多语言

GPT-4 会更多的语言

闽南语

![image.png](https://cdn.modevol.com/user/ckxe4nrot01l501s5g2ehd6ge/images/lxzmkxze1jnis6ccwaa7fof3.png)image.png

不是一知半解

在之前课程中讨论了“模型越大效果越差的任务”,在GPT-4这得到了反转,GPT-4能正确应对这些任务。

之前课程指:【精准空降到 14:50】 https://www.bilibili.com/video/BV1TD4y137mP/?p=29&share_source=copy_web&vd_source=04259c9260832797bf08914e26e438d5&t=890

知道自己不知道

GPT-4 在 pre-train 时知道自己不知道,但跟人做互动、从人类老师学习后反而失去 Calibration 的能力

![image.png](https://cdn.modevol.com/user/ckxe4nrot01l501s5g2ehd6ge/images/ivbg9mk35mu1sbx6qohfmipp.png)image.png

猜测 输入影像方式

可能的方式:

  1. 将图像转成文字

    • Caption Generation

    • 光学字元辨识 (OCR)

  2. 将图像转成向量:

    • Image Encoder

      • CLIP 离散化,然后每一个离散后的code用一个符号来表示,就把影像转成全新的语言

      • 仿照Kosmos

10-GPT4
Mleon的头像
创建于:2024-02-23
随记
讨论
媒体