[ATTW-L] AI code of conduct

Dragga, Sam Sam.Dragga at ttu.edu
Fri Jul 21 18:22:46 UTC 2023


For a timely example of a code of conduct, consider the Voluntary AI Commitments issued this morning at the White House by seven corporations engaged in AI technologies (Amazon, Anthropic, Google, Inflection, Meta, Microsoft, and OpenAI):

Safety
1) Commit to internal and external red-teaming of models or systems in areas including misuse, societal risks, and national security concerns, such as bio, cyber, and other safety areas.
2) Work toward information sharing among companies and governments regarding trust and safety risks, dangerous or emergent capabilities, and attempts to circumvent safeguards

Security
3) Invest in cybersecurity and insider threat safeguards to protect proprietary and unreleased model weights
4) Incent third-party discovery and reporting of issues and vulnerabilities

Trust
5) Develop and deploy mechanisms that enable users to understand if audio or visual content is AI-generated, including robust provenance, watermarking, or both, for AI-generated audio or visual content
6) Publicly report model or system capabilities, limitations, and domains of appropriate and inappropriate use, including discussion of societal risks, such as effects on fairness and bias
7) Prioritize research on societal risks posed by AI systems, including on avoiding harmful bias and discrimination, and protecting privacy
8) Develop and deploy frontier AI systems to help address society’s greatest challenges

The four-page agreement, with brief explanation of the three principles and eight actions, is available at https://www.whitehouse.gov/wp-content/uploads/2023/07/Ensuring-Safe-Secure-and-Trustworthy-AI.pdf.  It raises important questions for class discussion of the powers and limits of a code of conduct.


Sam

Sam Dragga
Professor Emeritus
Texas Tech University
sam.dragga at ttu.edu
1-806-543-6099

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://attw.org/pipermail/attw-l_attw.org/attachments/20230721/fa045355/attachment-0001.htm>


More information about the ATTW-L mailing list