Guides

Best Practices for Automating Your Security Questionnaires Response Process (1/3)

Lalita Hardier
March 27, 2024

Efficiently responding to security questions requires more than just a set of data and someone who uses that data to respond. It requires various elements such as having up-to-date, clean and structured data, knowing how to manage and maintain that data, and being able to collaborate efficiently. 

Therefore, we decided to divide the best-in-class practices for responding to security questionnaires into three chapters in order to cover all aspects. Each chapter will be published weekly including:

  • How to build and structure the right data repository 
  • How to manage and maintain the data over time
  • How to effectively enhance collaboration with teams

Chapter 1: How to Build and Structure the Right Data Repository

Building a knowledge base is crucial as it serves as the central repository of information, allowing the system to efficiently retrieve data needed to respond to security questionnaires through Retrieval-Augmentation-Generation (RAG) for example. This also means that the quality and accuracy of responses generated by AI are determined by how you build and manage your knowledge base. There are many ways to build a knowledge base, but today, the Skypher team will show you how to build the best knowledge base in 2 steps.

Step 1: Document & content provision

When you create a knowledge base from scratch, our team will help you to set it up. We will guide you through the best content selection process. Once we have identified together the right sets of data, Skypher will automatically build the knowledge base based on what you provided. Everything will be labeled and classified so you can make sure people have the right information when they need to confirm a response. Skypher comes pre-built with a powerful security framework that will help you structure your information. It’s not necessary to provide a load of data at the beginning. However, we recommend you to provide main sets of data including:

  • Some previous well filled-out previous questionnaires with up-to-date information
  • Security documents (such as policies, SOC-2 reports, certifications, and white papers)
  • Standard questionnaires CAIQ / SIG

By providing all of this, we can ensure that all data will be sufficient to respond to questionnaires during the POC or the initial stage of the deployment.

Additionally, you can also sync your knowledge base on Skypher with other external data sources, such as Google Drive, and internal or public web pages and wikis, such as an internal wiki. Thus, you don't need to manually move data between platforms and all data on Skypher will be updated automatically whenever changes are made elsewhere.

Step 2: Platform setup

After providing all the documents, the next question might be “What are we going to do with all the data you provided?” The answer is  we’re going to do the setup for you by putting all this data in the knowledge base, create templates* based on your security documents, and make it all ready to be used in responding to questions in security questionnaires. We are able to extract information from all your PDFs and structure it into templates (e.g extract a few lines from a document to respond to a question. All the documents are vectorized in the library making this unstructured data usable and searchable. We use Qdrant to do this. Users can later modify, delete, and add more data later by themselves in the knowledge base.

*A template is a set of a question and answer taken from documents that users provide. It’s used in responding to questions in questionnaires based on the accuracy score. You can update templates when you have better answers. Thus, you can always start responding to questions with the best suggestions.

Skypher will usually attach 20-30 similar questions to just one response as Enterprises usually formulate the questions a bit differently but expect the same response. We have built an entirely automated alias system that helps you to gather all similar questions in one template with one response. All this in order to keep your knowledge base easier to manage and maintain over time.

For the final step in setting up your knowledge base, we’ll define the logic in the data system together with you in order to structure the data. Questions regarding the definition of logic can include:

  • How many products do you want to respond to?: If you have many products, we can help you create labels to separate groups of documents and templates for each product. Thus, all data from different products don’t get mixed up. We also create a Company Wide category that is common to all your products. You can think of all the questions around human resource management that don’t require different answers based on a specific product but are rather common to the whole company. We regroup them in this category in order to avoid duplication of content with product libraries. 

To sum things up, Company Wide is the place where all data is shared with every product. For example, usually one question has a different answer depending on each product. However, for more general questions, they will be kept in a company Wide as they share the same answers for every product. By doing this, you can avoid the duplication of content in every product.

  • What is your deployment model? (on prem or cloud?): We provide you with the right setup and configuration for your specific deployment models. As each model requires different responses regarding hosting or infrastructure for example. So for each product, you can create specific sub-segment for different deployment models. 

 At Skypher, we also have a tagging feature that allows you to tag templates, so you know where or which category these templates belong. For example, you can also divide your data with NDA and Non NDA tags to differentiate the content you share with your customers depending on if they signed an NDA in the process. Moreover, we have an ownership feature that makes sure each user is accountable for maintaining data. (We’ll dive deeper into these essential features in the next article.)

Voila! Now you have the right knowledge base that is ready to be used. You have to always remember that all data used in the knowledge base should be up-to-date and maintaining it is crucial if you want your different teams to trust the software and see the automation results grow over time!

Security Questionnaire automation is only achievable if you get this first step right as otherwise you will only feed the algorithm with false or outdated data. We will cover this in the following parts of this blog post series.

Please feel free to reach out to us here if you need more information regarding the setup of your data. Thank you so much for your time and attention for this article. We hope to see you next week on the topic of “How to manage and maintain the data in the knowledge base.”

Have a good day!

Lalita Hardier
Lalita is our marketing manager. With a diverse background in marketing and international business, she drives our marketing initiatives and innovative strategies to enhance brand visibility and engagement.

Our latest news

Discover the latest news from Skypher whether it is features release, new customer stories, guides or updates

Ready to Scale Your Security Questionnaire Response Process?

Book a Demo