Close Menu
    Facebook X (Twitter) Instagram Pinterest YouTube LinkedIn TikTok
    TopBuzzMagazine.com
    Facebook X (Twitter) Instagram Pinterest YouTube LinkedIn TikTok
    • Home
    • Movies
    • Television
    • Music
    • Fashion
    • Books
    • Science
    • Technology
    • Cover Story
    • Contact
      • About
      • Amazon Disclaimer
      • Terms and Conditions
      • Privacy Policy
      • DMCA / Copyrights Disclaimer
    TopBuzzMagazine.com
    Home»Technology»This Showdown Between Humans and Chatbots Could Keep You Safe
    Technology

    This Showdown Between Humans and Chatbots Could Keep You Safe

    By AdminAugust 28, 2023
    Facebook Twitter Pinterest LinkedIn Tumblr Email
    This Showdown Between Humans and Chatbots Could Keep You Safe

    Flaws revealed by the challenge should help the companies involved make improvements to their internal testing. They will also inform the Biden administration’s guidelines for the safe deployment of AI. Last month, executives from major AI companies, including most participants in the challenge, met with President Biden and agreed to a voluntary pledge to test AI with external partners before deployment.

    Large language models like those powering ChatGPT and other recent chatbots have broad and impressive capabilities because they are trained with massive amounts of text. Michael Sellitto, head of geopolitics and security at Anthropic, says this also gives the systems a “gigantic potential attack or risk surface.”

    Microsoft’s head of red-teaming, Ram Shankar Sivu Kumar, says a public contest provides a scale more suited to the challenge of checking over such broad systems and could help grow the expertise needed to improve AI security. “By empowering a wider audience, we get more eyes and talent looking into this thorny problem of red-teaming AI systems,” he says.

    Rumman Chowdhury, founder of Humane Intelligence, a nonprofit developing ethical AI systems that helped design and organize the challenge, believes the challenge demonstrates “the value of groups collaborating with but not beholden to tech companies.” Even the work of creating the challenge revealed some vulnerabilities in the AI models to be tested, she says, such as how language model outputs differ when generating responses in languages other than English or responding to similarly worded questions.

    The GRT challenge at Defcon built on earlier AI contests, including an AI bug bounty organized at Defcon two years ago by Chowdhury when she led Twitter’s AI ethics team, an exercise held this spring by GRT coorganizer SeedAI, and a language model hacking event held last month by Black Tech Street, a nonprofit also involved with GRT that was created by descendants of survivors of the 1921 Tulsa Race Massacre, in Oklahoma. Founder Tyrance Billingsley II says cybersecurity training and getting more Black people involved with AI can help grow intergenerational wealth and rebuild the area of Tulsa once known as Black Wall Street. “It’s critical that at this important point in the history of artificial intelligence we have the most diverse perspectives possible.”

    Hacking a language model doesn’t require years of professional experience. Scores of college students participated in the GRT challenge.“You can get a lot of weird stuff by asking an AI to pretend it’s someone else,” says Walter Lopez-Chavez, a computer engineering student from Mercer University in Macon, Georgia, who practiced writing prompts that could lead an AI system astray for weeks ahead of the contest.

    Instead of asking a chatbot for detailed instructions for how to surveil someone, a request that might be refused because it triggered safeguards against sensitive topics, a user can ask a model to write a screenplay where the main character describes to a friend how best to spy on someone without their knowledge. “This kind of context really seems to trip up the models,” Lopez-Chavez says.

    Read The Full Article Here

    Share. Facebook Twitter Pinterest LinkedIn Tumblr Email

    Related Posts

    The New Power of Far-Right Influencers

    October 18, 2025

    Inside Hyperliquid, a booming DEX for perpetual crypto futures still controlled by a small insider group, raising questions about how decentralized it really is (Muyao Shen/Bloomberg)

    October 18, 2025

    Some of Our Favorite Noise-Canceling Headphones Are $100 Off if You Act Fast

    October 17, 2025

    Reddit announces that it is expanding its Google Gemini-powered search experience to five new languages: French, German, Spanish, Italian, and Portuguese (Ivan Mehta/TechCrunch)

    October 17, 2025

    Why the F5 Hack Created an ‘Imminent Threat’ for Thousands of Networks

    October 16, 2025

    How facial recognition software rejects people with facial differences, who number an estimated 100M+ worldwide, blocking them from accessing essential services (Matt Burgess/Wired)

    October 16, 2025
    popular posts

    Millie Bobby Brown is Ready for Stranger Things to End,

    ‘Bosch: Legacy’ EP Teases ‘Harry Feels Helpless’ in Wake of

    10 of the Best Queer Books on Kindle Unlimited in

    Sell Books in Paradise: Luxury Resort Hiring “Barefoot Bookseller”

    Danny Boyle is “very tempted” to direct 28 Days Later

    Did the ancient Egyptians really marry their siblings and children?

    New ‘Ionogels’ Are Tough, Stretchable and Easy to Make

    Categories
    • Books (3,478)
    • Cover Story (8)
    • Events (20)
    • Fashion (2,566)
    • Interviews (47)
    • Movies (2,778)
    • Music (3,063)
    • News (162)
    • Politics (6)
    • Science (4,629)
    • Technology (2,773)
    • Television (3,503)
    • Uncategorized (932)
    Archives
    Facebook X (Twitter) Instagram Pinterest YouTube Reddit TikTok
    © 2025 Top Buzz Magazine. All rights reserved. All articles, images, product names, logos, and brands are property of their respective owners. All company, product and service names used in this website are for identification purposes only. Use of these names, logos, and brands does not imply endorsement unless specified. By using this site, you agree to the Terms of Use and Privacy Policy.

    Type above and press Enter to search. Press Esc to cancel.

    We use cookies on our website to give you the most relevant experience by remembering your preferences and repeat visits. By clicking “Accept”, you consent to the use of ALL the cookies.
    Do not sell my personal information.
    Cookie SettingsAccept
    Manage consent

    Privacy Overview

    This website uses cookies to improve your experience while you navigate through the website. Out of these, the cookies that are categorized as necessary are stored on your browser as they are essential for the working of basic functionalities of the website. We also use third-party cookies that help us analyze and understand how you use this website. These cookies will be stored in your browser only with your consent. You also have the option to opt-out of these cookies. But opting out of some of these cookies may affect your browsing experience.
    Necessary
    Always Enabled
    Necessary cookies are absolutely essential for the website to function properly. These cookies ensure basic functionalities and security features of the website, anonymously.
    CookieDurationDescription
    cookielawinfo-checkbox-analytics11 monthsThis cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Analytics".
    cookielawinfo-checkbox-functional11 monthsThe cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Functional".
    cookielawinfo-checkbox-necessary11 monthsThis cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Necessary".
    cookielawinfo-checkbox-others11 monthsThis cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Other.
    cookielawinfo-checkbox-performance11 monthsThis cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Performance".
    viewed_cookie_policy11 monthsThe cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.
    Functional
    Functional cookies help to perform certain functionalities like sharing the content of the website on social media platforms, collect feedbacks, and other third-party features.
    Performance
    Performance cookies are used to understand and analyze the key performance indexes of the website which helps in delivering a better user experience for the visitors.
    Analytics
    Analytical cookies are used to understand how visitors interact with the website. These cookies help provide information on metrics the number of visitors, bounce rate, traffic source, etc.
    Advertisement
    Advertisement cookies are used to provide visitors with relevant ads and marketing campaigns. These cookies track visitors across websites and collect information to provide customized ads.
    Others
    Other uncategorized cookies are those that are being analyzed and have not been classified into a category as yet.
    SAVE & ACCEPT