Get More Data

本文探讨了为何在低偏差学习模型中需要更多数据以改善模型表现,并介绍了如何判断何时需要增加数据量的方法。此外,还提供了多种获取额外数据的途径,如人工合成数据、手动收集标注等。

摘要生成于 C知道 ,由 DeepSeek-R1 满血版支持, 前往体验 >

Get More Data

1. Why We Need More Data?

In many situations (low bias learning model), more data usually means better performance of the model.

2. When We Need More Data?

Usually, we should plot the learning curve by using part of the training data (1/10). If we have low bias curve, then we are safely increase the training data to get better machine learning model.

3. How to Get More Data?

  • Artificial data synthesis (e.g., rotation, crop, change background, etc)
  • Collect and label the data manually
  • Hire other company to label (e.g., Amazon Mechanical Turk)

Usually to make the original data 10 times larger won’t take so much effort, but it will make the performance of the model much better.

评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值