Gen AI Security:

Gen AI Security:

  • Unless you have been living under a rock, the last 6-9 months have been a second coming of AI. The previous wave saw the success of a few techniques like anomaly/outlier detection, clustering, advanced fraud/risk/financial models, computer vision (autonomous training, robotics, image recognition, etc.), NLP (natural language processing). The current wave is generative AI which generates text, audio, and video from prompts and was ushered in by OpenAI releasing a very powerful chatbot ChatGPT reaching 100 million users in no time
  • This week Sam Altman, CEO of OpenAI testified against Congress lobbying for regulation against AI citing concerns about misusing the technology and triggering debate about AI Safety. AI Safety has two angles
  1. Leveraging AI for harmful purposes: This includes impersonation, phishing, disinformation
  2. Tampering AI used for beneficial purposes: This includes hacking models, data poisoning, and data theft. We will discuss this angle in this post
  • Let's look at the AI pipeline and highlight a few key components
  1. Data Observability: Telemetry from data transformation, quality, the lineage of the data used for training and also monitoring the data in inference
  2. ModelOps, AI "Observability, AI "Assurance", AI" Explainability: These products focus on the model to ensure its performance, explainability, checking for bias, fairness, etc.
No alt text provided for this image
  • Let's look at the attack surface and vectors
  1. Training Data: Attackers poison training data either directly or through the supply chain (e.g. labeling). IMHO this is a very low-efficiency vector for attackers if you use 1st party data. You have to poison a meaningful amount of data to influence the model. For first-party data, you will have to breach the network and insert malicious data. If you breach the network and can access data, maybe a smart hacker will ransom or steal it instead? If you are using publicly available data, that data can be easily poisoned
  2. Model Vulnerabilities: Open source is a very popular starting point for AI/ML models and is similar to open-source middleware and packages (Linux, Python). If GenAI is wildly adopted and the starting point is open-source models, this could be a significant attack vector.
  3. Model Training: This is not a vector but a model that needs to be trained assuming adversarial attacks in production. So the model has to be stress tested during training to avoid failing in production. Even if the model is not explicitly attacked, the model has to be robust against some variance in data quality during inference
  4. Data Poisoning: Feeding malicious data during inference to force the model to fail. IMHO, this attack vector affects exposed models. For e.g., attackers can change speed limit signs and the model can fail causing accidents. If your model is embedded like a loan or risk model, poisoning would be a bit difficult as you will have to reverse engineer the model without getting caught. There are rich academic studies on this topic particularly with CV on how even a few pixel tampering can trip the model. For Gen AI this can also be a problem in terms of prompt injection or prompt engineering. You can force the model to spit sensitive data by prompt engineering.
  5. Model Output: This is a new attack surface/vector with Gen AI due to the popularity of autonomous agents and creating workflows by chaining agents. If you tamper with a previous stage output by injecting a different result you can tamper results down the chain. Prompt injection is the use of maliciously crafted prompts into generative AI to elicit incorrect, inaccurate, and even potentially offensive responses. This could be particularly mettlesome as developers fold ChatGPT and other Large Language Models (LLMs) into their applications so that a user's prompt is crunched by the AI and triggers some other action, such as posting content to a website or crafting automated emails that could potentially include incorrect or incendiary messages. A very good paper here - https://ar5iv.labs.arxiv.org/html/2302.12173
No alt text provided for this image
No alt text provided for this image
No alt text provided for this image

.

  1. Other exotic attack vectors include 1)  Sponge attack. In this attack type, adversaries can essentially conduct a denial of service attack against an AI model by specially crafting input to burn up the model's use of hardware consumption. 2) Model theft - Attackers are most likely to look for straightforward measures such as breaking into private source code repositories through phishing or password attacks to steal models outright.
No alt text provided for this image


  • Pre "GenAI" AI Security:
  1. Model Observability: These tools focused on performance, operationalizing models, passing regulations
  2. AI Security: These tools primarily focused on CV, NLP, and few internal models in regulated industries. It had some overlap with #1 above in terms of "AI Firewall" which overlapped with data observability and "AI continuous testing" which overlapped with ModelOps/AI Observability/Assurance. One unique feature was to stress test models for adversarial attacks.
No alt text provided for this image
  • Let's discuss Gen AI. Pre GenAI, AI was restricted to a few industries and vendors. B2C companies like Google, Facebook, DoorDash, etc had ton of data and invested a lot as it helped drive the business and brought new revenue opportunities. Fintech/Banks also adopted AI models for risk, fraud, and credit scoring. Operationally heavy companies adopted features like anomaly/outlier detection to monitor equipment and processes. Few cybersecurity companies in endpoint security, network monitoring, and IoT security adopted AI to detect anomalies, identify objects and get away from signatures
  • So how will customers use Gen AI? Here are the two models
  1. Employee facing: GenAI will bring in productivity across the enterprise, so I believe most enterprises will adopt GenAI in one or multiple of these consumptions models
  • ChatGPT/Bard/BingAI: These are best-of-breed LLM models (either hosted or SaaS) for use cases such as code, copywriting, marketing
  • Internal "ChatGPT": Consider this as a new employee portal or enterprise search layer. Employees can interact, ask questions, etc.
  • "LLM skin on SaaS": Consider this as the consumption layer on top of SaaS apps. SaaS apps will have their own LLM consumption layer but it might make sense to have a single pane of glass on all. These models will be trained on your own SaaS data
  • Autonomous Agents/ Gen AI workflows: Consider this as RPA 2.0. You can create IT automation or customer service automation. These workflows will be built by chaining multiple agents and maintained by IT
No alt text provided for this image

2. Business facing:

  • LLM skin on SaaS: Consider this as a consumption layer to search/analyze data in the apps. Even if there is no real use case for Gen AI in the product, let's say network firewall, vendors will still add this layer to better use the telemetry for operations. Snowflake is acquiring Neeva for adding search on top of the database so customers get more value from the data
  • Autonomous Agents/Copilots: Consider this as LLM native apps or LLM use cases inside an app like customer support/experience or marketing content generation
No alt text provided for this image
  • Gen AI Security: Now let's address the main topic. How is Gen AI security different from the prev generation (CV, NLP, etc)?
  1. The biggest difference is the exposure of data in Gen AI use cases. For SaaS apps, there is a risk of sensitive data being shared and being used for training. Samsung Data leak is a recent mishap. Vendors will offer privately hosted modules, but customers will still need to monitor data. Other risk is data security, privacy. Consider a hacker who got hold of employee's credential and now with the LLM layers can access any sensitive data. All authorizations break with LLMs and need to be redone. The other risk is labeling, and curating data used for the home-grown trained LLMs
  2. Model vulnerabilities: If enterprises start with open-source models, the vulnerabilities in these models are going to be ongoing issues. Also, hackers can reverse-engineer these models without getting inside the network
  3. What about MLDR? ML Detection and Response is an architecture pioneered by HiddenLayer. The company has origins in Cylance where the team saw hackers attacking the AI model to reverse engineer/break the model. So collecting and analyzing telemetry similar to collecting telemetry from your system (OS) or application logs can detect persistent attacks to the model. This assumes that the hacker is in the system and is avoiding getting caught by the AI model.
  • Components of Gen AI security:
  1. Data/Model Observability, Scanning: You need these traditional components for Gen AI similar to prev models. They will be required for tuning, and operationalizing the model and can be used for security as well. I believe if a company is using only Gen AI, a purpose-built product would be better
No alt text provided for this image

2. Data Layer: This is the net new component for Gen AI security. You need a forward, backward proxy DLP layer to control data leaks. Models will never be perfect and prompt engineering will be expensive. You need a layer in front of these models and also in front of the data these models are trained on

No alt text provided for this image
No alt text provided for this image
  • Landscape:
  1. The previous-gen model observability/modelOps, AI security startups have gotten a second wind in their sails with GenAI. Almost all will offer Gen AI security products
  2. I do think a purpose-built Gen AI solution that combines the capabilities of model observability/Modelops and AI security is better for customers only using Gen AI.
  3. Organizations working on AI models pre GenAI will likely stick with the legacy vendors and use their Gen AI products
No alt text provided for this image

Thank you for sharing, Mr. Gosavi. Gen AI security refers to the application of AI in securing and protecting the evolving landscape of artificial general intelligence (AGI). It involves developing AI systems that can defend against potential threats, ensure the safety and ethical use of AGI, and mitigate risks associated with its deployment. Gen AI security encompasses various areas such as robustness, transparency, privacy, and governance. By leveraging AI techniques, we can proactively address the challenges and risks associated with AGI, ensuring its responsible development and deployment in a secure and beneficial manner for humanity. Anubrain Technology is an AI- based developer working on security & surveillance: https://anubrain.com/ai-in-security-surveillance/.

Abhishek Agarwalla

CEO & Co-founder @ Fabric

1y

Prompt injection poses a genuine threat to language models, and regrettably, there is no foolproof solution to mitigate this risk. This intricate challenge encompasses not only the reputation and ethical considerations of organizations but also the potential for data leakage.

🛡️ Kyle H.

CTO & Co-Founder at PhishCloud Inc.

1y

So we need to protect the data while we teach the AI how to protect itself? Interesting challenge, indeed. 🤔 #GenAI #DataSecurity 

Gagandeep Pahwa

Cybersecurity product management@Salesforce | CISSP | Ex- TAS Manager @Tata Group | CIO NEXT100 2023 Winner

1y

AI code security. To protect against attacks like model poisoning etc, coding has to be done in a secure way. Basic sdlc security (which most cos are already not perfect at), will need to be boosted by AI vulnerability scanning. I foresee it will be a separate class of vulnerabilities requiring separate scanning tools similar to a SAST/ secret scanning tool. In an ideal world, it would have been great to have an open source community create such solutions but it needs to be balanced with keeping the model proprietary and secure. Interesting times ahead!

Itamar Golan

Co-Founder & CEO @ Prompt Security | OWASP Top10 for LLM-based Apps | Hiring DS, SW

1y

Brilliant as always Pramod Gosavi

To view or add a comment, sign in

Insights from the community

Others also viewed

Explore topics