10th Indian Delegation to Dubai, Gitex & Expand North Star – World’s Largest Startup Investor Connect
Artificial Intelligence

OpenAI proposes a new way to use GPT-4 for content moderation, easing human workload

OpenAI has introduced a method to leverage its advanced AI model, GPT-4, for content moderation, aiming to lessen the workload on human moderation teams. The approach, outlined in a recent OpenAI blog post, involves prompting GPT-4 with a specific policy guiding its moderation decisions. This includes creating a test dataset of content examples that may or may not violate the policy.

OpenAI Policy Refinement Through Human Labeling and Model Feedback

To refine this process, policy experts label the content examples and then present them, devoid of labels, to GPT-4. The model’s determinations are compared to those of humans, allowing policy experts to analyze discrepancies, seek reasoning behind GPT-4’s judgments, and clarify ambiguities within the policy. This iterative approach aims to enhance the quality of the moderation policy.

OpenAI Promised Reduction in Policy Rollout Time

OpenAI claims that its technique, already adopted by several customers, has the potential to significantly reduce the time required to implement new content moderation policies. The process could be streamlined to just a matter of hours. OpenAI contends that its method surpasses alternative approaches, including those proposed by startups like Anthropic, which OpenAI criticizes for their rigidity in relying on models’ “internalized judgments.”

Challenges and Biases in AI-Powered Moderation

While AI-driven moderation tools have gained traction, challenges persist. Biases within training datasets, introduced by human annotators, can impact the effectiveness of such tools. OpenAI acknowledges these challenges, noting that AI-generated judgments are susceptible to undesired biases from the training process. Ongoing human validation and refinement remain crucial to mitigate these biases.

GPT-4’s Potential and Caution in Moderation

Although GPT-4’s predictive capabilities hold promise for improved moderation, OpenAI acknowledges the need for careful monitoring and validation due to inherent biases and potential errors. OpenAI’s initiative to harness GPT-4 for content moderation demonstrates a step toward automating moderation tasks, yet it is essential to remember that AI, even at its best, can still make errors. Maintaining human oversight remains vital to ensure responsible and unbiased content moderation.

Also Read The Latest News:
Ola Electric unveils four new electric bikes following S1 X scooter launch
X, formerly Twitter, faces scrutiny for slowing down access to disliked websites

by 9to5mac

Apple is reportedly delaying the launch of the iPhone Air 2. The Information reports that Apple recently “notified engineers and suppliers that they were taking the next iPhone Air off the schedule without providing a new release date.” The report cites “three people involved in the project.” iPhone Air 2 release delayed The second-generation iPhone Air was initially set to launch next fall alongside the iPhone 18 Pro and iPhone Fold. According to The Information, the… Source link

by CNBCTV

The 245th Report of the Parliamentary Standing Committee calls for a review of the IT Act 2000 since many of the serious offences are bailable; it has recommended amending the Act to make the offences severely punishable and to make intermediaries responsible for compensating victims, notes former Central Board of Indirect Tax & Customs chairman Najib Shah. Source link

by 9to5mac

Listen to a recap of the top stories of the day from 9to5Mac. 9to5Mac Daily is available on iTunes and Apple’s Podcasts app, Stitcher, TuneIn, Google Play, or through our dedicated RSS feed for Overcast and other podcast players. Sponsored by Backblaze: Never lose a file again. Use code “9to5daily” at checkout for 20% off or try for free.  New episodes of 9to5Mac Daily are recorded every weekday. Subscribe to our podcast in Apple Podcast or your favorite podcast… Source link