Site Reliability Test

The Site Reliability Engineer (SRE) Test uses scenario-based questions to evaluate knowledge of cloud technologies, system design, automation, and troubleshooting skills. It assesses understanding of infrastructure as code, continuous integration and deployment, and monitoring systems. The test also measures proficiency in scripting languages and hands-on coding for infrastructure problem-solving. It further includes real-world situations to examine critical thinking and incident management abilities.

Covered skills:

System Design and Architecture
Infrastructure as Code (IaC)
Continuous Integration/Continuous Deployment (CI/CD)
Understanding of Networking Concepts
Monitoring and Logging Systems
Incident Management and Post-Mortem Analysis
Performance Tuning and Load Balancing
Database Reliability and Scalability
Understanding of Security Principles
Disaster Recovery Planning and Execution
Microservices and Containerization
Service Level Objectives (SLOs) and Error Budgets
Traffic Management and Distributed Systems
High Availability and Resiliency Strategies
Capacity Planning and Resource Optimization

Test Duration

40 mins

Difficulty Level

Moderate

Questions

5 Site Reliability Engineering MCQs
5 DevOps MCQs
3 Docker MCQs
3 Kubernetes MCQs

Availability

Ready to use

Get started for free

Preview questions

About the Site Reliability Assessment Test

The Site Reliability Test helps recruiters and hiring managers identify qualified candidates from a pool of resumes, and helps in taking objective hiring decisions. It reduces the administrative overhead of interviewing too many candidates and saves time by filtering out unqualified candidates at the first step of the hiring process.

The test screens for the following skills that hiring managers look for in candidates:

Proficient in Site Reliability Engineering practices and principles
Experience in DevOps methodologies and tools
Knowledge of Docker containerization
Understanding of Kubernetes orchestration
Ability to design robust systems and architectures
Familiarity with Infrastructure as Code (IaC) concepts
Expertise in Continuous Integration/Continuous Deployment (CI/CD) pipelines
Comprehension of Networking Concepts in distributed systems
Skill in implementing Monitoring and Logging Systems
Proficient in Incident Management and Post-Mortem Analysis
Experience in Performance Tuning and Load Balancing
Expertise in ensuring Database Reliability and Scalability
Knowledge of Security Principles in system design
Familiarity with Disaster Recovery Planning and Execution
Understanding of Microservices and Containerization
Skill in defining Service Level Objectives (SLOs) and Error Budgets
Knowledge of Traffic Management and Distributed Systems
Expertise in High Availability and Resiliency Strategies
Ability to perform Capacity Planning and Resource Optimization

1200+ customers in 80 countries

Use Adaface tests trusted by recruitment teams globally. Adaface skill assessments measure on-the-job skills of candidates, providing employers with an accurate tool for screening potential hires.

Get started for free

Preview questions

Non-googleable questions

We have a very high focus on the quality of questions that test for on-the-job skills. Every question is non-googleable and we have a very high bar for the level of subject matter experts we onboard to create these questions. We have crawlers to check if any of the questions are leaked online. If/ when a question gets leaked, we get an alert. We change the question for you & let you know.

How we design questions

These are just a small sample from our library of 15,000+ questions. The actual questions on this Site Reliability Test will be non-googleable.

🧐 Question
Medium Error Budget Management Latency Monitoring Error Budgets Distributed Tracing	Solve
You are a site reliability engineer responsible for maintaining a microservices-based e-commerce platform. Your system consists of several independent services, each deployed on its separate container within a Kubernetes cluster. Your organization follows a strict Service Level Objective (SLO) to maintain user satisfaction, which mandates that the 95th percentile latency for all requests over a 30-day period should not exceed 200 ms. The following pseudo-code represents a simplified version of the request processing in your system: You realize that over the first two weeks of the current 30-day window, the 95th percentile latency has risen to 250 ms. Analyzing further, you discover that out of 10 million requests, 600,000 requests took more than 200 ms to complete. Given these facts, which of the following is the most effective course of action that you can take to troubleshoot and reduce the system's latency issues? A: Change the latency log level to debug to gather more information. B: Increase the SLO for latency to 250 ms to accommodate the current system performance. C: Introduce more instances of each microservice to handle the increased load. D: Implement a distributed tracing mechanism to identify the microservices contributing most to the latency. E: Implement request throttling to reduce the overall number of requests.
Medium Incident Response Procedure Incident Management Disaster Recovery System Optimization	Solve
You are an SRE for a large-scale distributed system. The system architecture includes five primary servers (P1 to P5) and three backup servers (B1 to B3). The system uses an advanced load balancer that distributes the workload across the primary servers evenly. One day, the monitoring system triggers an alert that server P5 is not responding. The pseudo-code for the current incident response procedure is as follows: The function 'replaceServer(server)' replaces the failed server with a new one from a pool of spare servers, which takes around 30 minutes. The current discussion revolves around modifying this procedure to improve system resilience and minimize potential downtime. The backup servers are underutilized and could be leveraged more effectively. Also, the load balancer can dynamically shift workloads based on server availability and response time. Based on the situation above, what is the best approach to optimize the incident response procedure? A: Implement an early warning system to predict server failures and prevent them. B: Upon failure detection, immediately divert traffic to backup servers, then attempt to reboot the primary server, and replace if necessary. C: Replace the failed server without attempting a reboot and keep the traffic on primary servers. D: Enable auto-scaling to add more servers when a primary server fails. E: Switch to a more advanced load balancer that can detect and handle server failures independently.
Medium Service Balancer Decision-making Load Balancing Distributed Systems Concurrent Processing	Solve
You are a Site Reliability Engineer (SRE) working on a distributed system with a load balancer that distributes requests across a number of servers based on the current load. The decision algorithm for load balancing is written in pseudo-code as follows: The system receives a large burst of requests. In response to this, some engineers propose increasing the `threshold` value to allow for more requests to be handled concurrently by each server. Others argue that instead, we should increase the number of servers to distribute the load more evenly. Consider that the system has auto-scaling capabilities based on the average load of all servers, but the scaling operation takes about 15 minutes to add new servers to the pool. Also, the servers' performance degrades sharply if the load is much above the threshold. One of the engineers also proposes modifying the getServer function logic to distribute the incoming load one by one across all servers to trigger the average load to rise faster. Based on this scenario, what is the best approach? A: Increase the `threshold` value to allow more requests on each server. B: Add more servers to distribute the load, regardless of the auto-scaling delay. C: Modify the getServer function to distribute the incoming load one by one across all servers to trigger the average load to rise faster. D: Increase the `threshold` and add more servers simultaneously. E: Manually trigger the auto-scaling process before the load increases.
Medium Resource Analysis Process Management System Performance Log Analysis	Solve
As a senior DevOps engineer, you are tasked with diagnosing performance issues on a Linux server running Ubuntu 20.04. The server hosts several critical applications, but lately, users have been experiencing significant slowness. Initial monitoring shows that CPU and memory utilization are consistently high. To identify the root cause, you check the output of `top` and `ps` commands, which indicate that a particular process is consuming an unusually high amount of resources. However, the process name is generic and does not clearly indicate which application or service it belongs to. You also examine `/var/log/syslog` for any unusual entries but find nothing out of the ordinary. Based on this situation, which of the following steps would most effectively help you identify and resolve the performance issue? A: Increase the server's physical memory and CPU capacity. B: Use the `lsof` command to identify the files opened by the suspect process. C: Reboot the server to reset all processes. D: Examine the `/etc/hosts` file for any incorrect configurations. E: Run the `netstat` command to check for abnormal network activity. F: Check the crontab for any recently added scheduled tasks.
Medium Streamlined DevOps Continuous Integration Scripting Git Shell Scripting	Solve
You are in charge of developing a Bash script for setting up a continuous integration pipeline for a web application. The source code is hosted in a Git repository. The script's goals include: 1. Ensuring the local copy of the repository in /var/www/html is updated to the latest version. 2. Creating a .env file with APP_ENV=production in the project root if it doesn't already exist. 3. Running a test suite with ./run_tests.sh and handling any test failures appropriately. 4. Logging the current timestamp and commit hash in deployment_log.txt in the project root if tests pass. Which of the following script options would most effectively and safely accomplish these tasks?
Medium Docker Multistage Build Analysis Multistage Builds Optimization Dockerfile Syntax	Solve
Consider the following Dockerfile, which utilizes multistage builds. The aim is to build a lightweight, optimized image that just runs the application. The Dockerfile first defines a base image that includes Node.js and npm, then it creates an intermediate image to install the npm dependencies. Afterwards, it runs the tests in another stage and finally, creates the release image. Which of the following statements are true? A: The final image will include the test scripts. B: If a test fails, the final image will not be created. C: The node_modules directory in the final image comes from the base image. D: The final image will only contain the necessary application files and dependencies. E: If the application's source code changes, only the release stage needs to be rebuilt.
Easy Docker Networking and Volume Mounting Interplay Networking Volume Mounting Docker Networking	Solve
You have two docker containers, X and Y. Container X is running a web service listening on port 8080, and container Y is supposed to consume this service. Both containers are created from images that don't have any special network configurations. Container X has a Dockerfile as follows: And, you build and run it with the following commands: Container Y is also running alpine with python installed, and it's supposed to read data from the `/app/data` directory and send a GET request to `http://localhost:8080` every 5 minutes. The Dockerfile for container B is: And you run it with: Assuming all the python scripts work perfectly and firewall isn't blocking any connections, you find that container Y can't access the web service of container X via `http://localhost:8080` and also it can't read the data in `/app/data` directory. What could be the potential reason(s)? A: Y can't access X's web service because they're in different Docker networks. B: Y can't read the data because the volume is not shared correctly. C: Both A and B are correct. D: Both A and B are incorrect.
Medium Dockerfile Optimization Dockerfile Multi-stage builds Layer Caching	Solve
You have been asked to optimize a Dockerfile for a Python application that involves a heavy dependency installation. Here is the Dockerfile you are starting with: Given that the application's source code changes frequently but the dependencies listed in requirements.txt rarely change, how can you optimize this Dockerfile to take advantage of Docker's layer caching, reducing the build time? A: Move the `RUN pip install` command to before the `COPY` command. B: Change `COPY . /app` to `COPY ./app.py /app` and move the `RUN pip install` command to before the `COPY` command. C: Add `RUN pip cache purge` before `RUN pip install`. D: Replace the base image with `python:3.8-slim`. E: Implement multi-stage builds.
Medium Dockerfile Updates Cache Docker Cache Strategies	Solve
Check the following Dockerfile used for a project (STAGE 1): We created an image from this Dockerfile on Dec 14 2021. A couple of weeks after Dec 14 2021, Ubuntu released new security updates to their repository. After 2 months, we modified the file (STAGE 2): Couple of weeks later, we further modified the file to add a local file ada.txt to /ada.txt (STAGE 3): (Note that ada.txt exists in /home/adaface and the dockerfile exists in /home/code folders) Pick correct statements: A: If we run “docker build .” at STAGE 2, new Ubuntu updates will be fetched because apt-get update will be run again since cache is invalidated for all lines/layers of Dockerfile when a new line is added. B: If we run “docker build .” at STAGE 2, new Ubuntu updates will not be fetched since cache is invalidated only for last two lines of the updated Dockerfile. Since the first two commands remain the same, cached layers are re-used skipping apt get update. C: To skip Cache, “docker build -no-cache .” can be used at STAGE 2. This will ensure new Ubuntu updates are picked up. D: Docker command “docker build .” at STAGE 3 works as expected and adds local file ada.txt to the image. E: Docker command “docker build .” at STAGE 3 gives an error “no such file or directory” since /home/adaface/ada.txt is not part of the Dockerfile context.
Medium Efficient Dockerfile Dockerfile Dockerfile Syntax Containerization Resource Optimization	Solve
Review the following Dockerfiles that work on two projects (project and project2): All Docker files have the same end result: - ‘project’ is cloned from git. After running few commands, ‘project’ code is removed. - ‘project2’ is copied from file system and permissions to the folder is changed. Pick the correct statements: A: File 1 is the most efficient of all. B: File 2 is the most efficient of all. C: File 3 is the most efficient of all. D: File 4 is the most efficient of all. E: Merging multiple RUN commands into a single RUN command is efficient for ‘project’ since each RUN command creates a new layer with changed files and folders. Deleting files with RUN only marks these files as deleted but does not reclaim disk space. F: Copying ‘project2’ files and changing ownership in two separate commands will result in two layers since Docker duplicates all the files twice.
Medium ConfigMap and Secrets Interaction Resource Management Security Kubernetes Configuration Kubernetes Security	Solve
In a Kubernetes cluster, you are working on configuring a new deployment that should be able to access specific environment variables through both ConfigMap and Secrets resources. The deployment YAML is structured as follows: You have applied the above YAML successfully without any errors. Now, you are about to configure a service to expose the deployment. Before doing that, you want to confirm the security and setup implications. Based on the above configuration, which of the following statements are true? 1. The DATABASE_PASSWORD will be mounted as an environment variable in plain text. 2. The ConfigMap data can be updated and the changes will be reflected automatically in the running pods without any need for a redeployment. 3. If a potential attacker gains access to the cluster, they would be able to retrieve the DATABASE_PASSWORD in plain text from the secrets resource as it is defined in stringData. 4. The APP_ENV and DATABASE_URL values are securely stored and cannot be accessed by non-admin users. 5. If a new container in the same pod is created, it would automatically have the DATABASE_PASSWORD environment variable configured.
Medium Ingress from namespace Network Network Policies Kubernetes Network Policy Yaml Configuration	Solve
You are tasked with deploying a Kubernetes network policy. Here are the specifications: - Name of the policy: adaface-namespace - Policy to be deployed in ‘chatbot’ namespace - The policy should allow ALL traffic only from ‘tester’ namespace - Policy should not allow communication between pods in the same namespace - Traffic only from ‘tester’ namespace is allowed on all ports Which of the following configuration files is BEST suited to create required dependencies and deploy the network policy?
Medium Pod Affinity and Resource Quota Compliance Pod Scheduling Resource Management Kubernetes Deployment	Solve
You are working on a Kubernetes project where you need to ensure that certain pods get scheduled on nodes based on the presence of other pods and to limit the amount of resources that can be consumed in a namespace. You have been given the following YAML file which contains a combination of a pod definition and a resource quota: With the application of the above YAML configuration, assess the validity of the statements and choose the correct option that lists all the true statements. 1. The critical-pod will only be scheduled on nodes where at least one pod with a label security=high is already running. 2. The critical-pod is adhering to the resource quotas defined in the compute-quota. 3. The compute-quota restricts the namespace to only allow a total of 1 CPU and 1Gi memory in requests and 2 CPUs and 2Gi memory in limits across all pods. 4. If a node has multiple pods labeled with security=high, the critical-pod can potentially be scheduled on that node, given other scheduling constraints are met. 5. The critical-pod exceeds the defined memory request quota as per the compute-quota.
Easy Resource limits Pods Containers	Solve
How would you deploy a Kubernetes pod with the following specifications: - Name of pod: adaface - Resource limits: 1 CPU and 512Mi memory - Image: haproxy A: kubectl run adaface --image=haproxy --limits='cpu=1,memory=512Mi' B: kubectl run adaface --image=haproxy --requests='cpu=1,memory=512Mi'

	🧐 Question	🔧 Skill
	Medium Error Budget Management Latency Monitoring Error Budgets Distributed Tracing	3 mins Site Reliability Engineering	Solve
You are a site reliability engineer responsible for maintaining a microservices-based e-commerce platform. Your system consists of several independent services, each deployed on its separate container within a Kubernetes cluster. Your organization follows a strict Service Level Objective (SLO) to maintain user satisfaction, which mandates that the 95th percentile latency for all requests over a 30-day period should not exceed 200 ms. The following pseudo-code represents a simplified version of the request processing in your system: You realize that over the first two weeks of the current 30-day window, the 95th percentile latency has risen to 250 ms. Analyzing further, you discover that out of 10 million requests, 600,000 requests took more than 200 ms to complete. Given these facts, which of the following is the most effective course of action that you can take to troubleshoot and reduce the system's latency issues? A: Change the latency log level to debug to gather more information. B: Increase the SLO for latency to 250 ms to accommodate the current system performance. C: Introduce more instances of each microservice to handle the increased load. D: Implement a distributed tracing mechanism to identify the microservices contributing most to the latency. E: Implement request throttling to reduce the overall number of requests.
	Medium Incident Response Procedure Incident Management Disaster Recovery System Optimization	3 mins Site Reliability Engineering	Solve
You are an SRE for a large-scale distributed system. The system architecture includes five primary servers (P1 to P5) and three backup servers (B1 to B3). The system uses an advanced load balancer that distributes the workload across the primary servers evenly. One day, the monitoring system triggers an alert that server P5 is not responding. The pseudo-code for the current incident response procedure is as follows: The function 'replaceServer(server)' replaces the failed server with a new one from a pool of spare servers, which takes around 30 minutes. The current discussion revolves around modifying this procedure to improve system resilience and minimize potential downtime. The backup servers are underutilized and could be leveraged more effectively. Also, the load balancer can dynamically shift workloads based on server availability and response time. Based on the situation above, what is the best approach to optimize the incident response procedure? A: Implement an early warning system to predict server failures and prevent them. B: Upon failure detection, immediately divert traffic to backup servers, then attempt to reboot the primary server, and replace if necessary. C: Replace the failed server without attempting a reboot and keep the traffic on primary servers. D: Enable auto-scaling to add more servers when a primary server fails. E: Switch to a more advanced load balancer that can detect and handle server failures independently.
	Medium Service Balancer Decision-making Load Balancing Distributed Systems Concurrent Processing	2 mins Site Reliability Engineering	Solve
You are a Site Reliability Engineer (SRE) working on a distributed system with a load balancer that distributes requests across a number of servers based on the current load. The decision algorithm for load balancing is written in pseudo-code as follows: The system receives a large burst of requests. In response to this, some engineers propose increasing the `threshold` value to allow for more requests to be handled concurrently by each server. Others argue that instead, we should increase the number of servers to distribute the load more evenly. Consider that the system has auto-scaling capabilities based on the average load of all servers, but the scaling operation takes about 15 minutes to add new servers to the pool. Also, the servers' performance degrades sharply if the load is much above the threshold. One of the engineers also proposes modifying the getServer function logic to distribute the incoming load one by one across all servers to trigger the average load to rise faster. Based on this scenario, what is the best approach? A: Increase the `threshold` value to allow more requests on each server. B: Add more servers to distribute the load, regardless of the auto-scaling delay. C: Modify the getServer function to distribute the incoming load one by one across all servers to trigger the average load to rise faster. D: Increase the `threshold` and add more servers simultaneously. E: Manually trigger the auto-scaling process before the load increases.
	Medium Resource Analysis Process Management System Performance Log Analysis	3 mins DevOps	Solve
As a senior DevOps engineer, you are tasked with diagnosing performance issues on a Linux server running Ubuntu 20.04. The server hosts several critical applications, but lately, users have been experiencing significant slowness. Initial monitoring shows that CPU and memory utilization are consistently high. To identify the root cause, you check the output of `top` and `ps` commands, which indicate that a particular process is consuming an unusually high amount of resources. However, the process name is generic and does not clearly indicate which application or service it belongs to. You also examine `/var/log/syslog` for any unusual entries but find nothing out of the ordinary. Based on this situation, which of the following steps would most effectively help you identify and resolve the performance issue? A: Increase the server's physical memory and CPU capacity. B: Use the `lsof` command to identify the files opened by the suspect process. C: Reboot the server to reset all processes. D: Examine the `/etc/hosts` file for any incorrect configurations. E: Run the `netstat` command to check for abnormal network activity. F: Check the crontab for any recently added scheduled tasks.
	Medium Streamlined DevOps Continuous Integration Scripting Git Shell Scripting	2 mins DevOps	Solve
You are in charge of developing a Bash script for setting up a continuous integration pipeline for a web application. The source code is hosted in a Git repository. The script's goals include: 1. Ensuring the local copy of the repository in /var/www/html is updated to the latest version. 2. Creating a .env file with APP_ENV=production in the project root if it doesn't already exist. 3. Running a test suite with ./run_tests.sh and handling any test failures appropriately. 4. Logging the current timestamp and commit hash in deployment_log.txt in the project root if tests pass. Which of the following script options would most effectively and safely accomplish these tasks?
	Medium Docker Multistage Build Analysis Multistage Builds Optimization Dockerfile Syntax	3 mins Docker	Solve
Consider the following Dockerfile, which utilizes multistage builds. The aim is to build a lightweight, optimized image that just runs the application. The Dockerfile first defines a base image that includes Node.js and npm, then it creates an intermediate image to install the npm dependencies. Afterwards, it runs the tests in another stage and finally, creates the release image. Which of the following statements are true? A: The final image will include the test scripts. B: If a test fails, the final image will not be created. C: The node_modules directory in the final image comes from the base image. D: The final image will only contain the necessary application files and dependencies. E: If the application's source code changes, only the release stage needs to be rebuilt.
	Easy Docker Networking and Volume Mounting Interplay Networking Volume Mounting Docker Networking	3 mins Docker	Solve
You have two docker containers, X and Y. Container X is running a web service listening on port 8080, and container Y is supposed to consume this service. Both containers are created from images that don't have any special network configurations. Container X has a Dockerfile as follows: And, you build and run it with the following commands: Container Y is also running alpine with python installed, and it's supposed to read data from the `/app/data` directory and send a GET request to `http://localhost:8080` every 5 minutes. The Dockerfile for container B is: And you run it with: Assuming all the python scripts work perfectly and firewall isn't blocking any connections, you find that container Y can't access the web service of container X via `http://localhost:8080` and also it can't read the data in `/app/data` directory. What could be the potential reason(s)? A: Y can't access X's web service because they're in different Docker networks. B: Y can't read the data because the volume is not shared correctly. C: Both A and B are correct. D: Both A and B are incorrect.
	Medium Dockerfile Optimization Dockerfile Multi-stage builds Layer Caching	2 mins Docker	Solve
You have been asked to optimize a Dockerfile for a Python application that involves a heavy dependency installation. Here is the Dockerfile you are starting with: Given that the application's source code changes frequently but the dependencies listed in requirements.txt rarely change, how can you optimize this Dockerfile to take advantage of Docker's layer caching, reducing the build time? A: Move the `RUN pip install` command to before the `COPY` command. B: Change `COPY . /app` to `COPY ./app.py /app` and move the `RUN pip install` command to before the `COPY` command. C: Add `RUN pip cache purge` before `RUN pip install`. D: Replace the base image with `python:3.8-slim`. E: Implement multi-stage builds.
	Medium Dockerfile Updates Cache Docker Cache Strategies	2 mins Docker	Solve
Check the following Dockerfile used for a project (STAGE 1): We created an image from this Dockerfile on Dec 14 2021. A couple of weeks after Dec 14 2021, Ubuntu released new security updates to their repository. After 2 months, we modified the file (STAGE 2): Couple of weeks later, we further modified the file to add a local file ada.txt to /ada.txt (STAGE 3): (Note that ada.txt exists in /home/adaface and the dockerfile exists in /home/code folders) Pick correct statements: A: If we run “docker build .” at STAGE 2, new Ubuntu updates will be fetched because apt-get update will be run again since cache is invalidated for all lines/layers of Dockerfile when a new line is added. B: If we run “docker build .” at STAGE 2, new Ubuntu updates will not be fetched since cache is invalidated only for last two lines of the updated Dockerfile. Since the first two commands remain the same, cached layers are re-used skipping apt get update. C: To skip Cache, “docker build -no-cache .” can be used at STAGE 2. This will ensure new Ubuntu updates are picked up. D: Docker command “docker build .” at STAGE 3 works as expected and adds local file ada.txt to the image. E: Docker command “docker build .” at STAGE 3 gives an error “no such file or directory” since /home/adaface/ada.txt is not part of the Dockerfile context.
	Medium Efficient Dockerfile Dockerfile Dockerfile Syntax Containerization Resource Optimization	2 mins Docker	Solve
Review the following Dockerfiles that work on two projects (project and project2): All Docker files have the same end result: - ‘project’ is cloned from git. After running few commands, ‘project’ code is removed. - ‘project2’ is copied from file system and permissions to the folder is changed. Pick the correct statements: A: File 1 is the most efficient of all. B: File 2 is the most efficient of all. C: File 3 is the most efficient of all. D: File 4 is the most efficient of all. E: Merging multiple RUN commands into a single RUN command is efficient for ‘project’ since each RUN command creates a new layer with changed files and folders. Deleting files with RUN only marks these files as deleted but does not reclaim disk space. F: Copying ‘project2’ files and changing ownership in two separate commands will result in two layers since Docker duplicates all the files twice.
	Medium ConfigMap and Secrets Interaction Resource Management Security Kubernetes Configuration Kubernetes Security	2 mins Kubernetes	Solve
In a Kubernetes cluster, you are working on configuring a new deployment that should be able to access specific environment variables through both ConfigMap and Secrets resources. The deployment YAML is structured as follows: You have applied the above YAML successfully without any errors. Now, you are about to configure a service to expose the deployment. Before doing that, you want to confirm the security and setup implications. Based on the above configuration, which of the following statements are true? 1. The DATABASE_PASSWORD will be mounted as an environment variable in plain text. 2. The ConfigMap data can be updated and the changes will be reflected automatically in the running pods without any need for a redeployment. 3. If a potential attacker gains access to the cluster, they would be able to retrieve the DATABASE_PASSWORD in plain text from the secrets resource as it is defined in stringData. 4. The APP_ENV and DATABASE_URL values are securely stored and cannot be accessed by non-admin users. 5. If a new container in the same pod is created, it would automatically have the DATABASE_PASSWORD environment variable configured.
	Medium Ingress from namespace Network Network Policies Kubernetes Network Policy Yaml Configuration	3 mins Kubernetes	Solve
You are tasked with deploying a Kubernetes network policy. Here are the specifications: - Name of the policy: adaface-namespace - Policy to be deployed in ‘chatbot’ namespace - The policy should allow ALL traffic only from ‘tester’ namespace - Policy should not allow communication between pods in the same namespace - Traffic only from ‘tester’ namespace is allowed on all ports Which of the following configuration files is BEST suited to create required dependencies and deploy the network policy?
	Medium Pod Affinity and Resource Quota Compliance Pod Scheduling Resource Management Kubernetes Deployment	2 mins Kubernetes	Solve
You are working on a Kubernetes project where you need to ensure that certain pods get scheduled on nodes based on the presence of other pods and to limit the amount of resources that can be consumed in a namespace. You have been given the following YAML file which contains a combination of a pod definition and a resource quota: With the application of the above YAML configuration, assess the validity of the statements and choose the correct option that lists all the true statements. 1. The critical-pod will only be scheduled on nodes where at least one pod with a label security=high is already running. 2. The critical-pod is adhering to the resource quotas defined in the compute-quota. 3. The compute-quota restricts the namespace to only allow a total of 1 CPU and 1Gi memory in requests and 2 CPUs and 2Gi memory in limits across all pods. 4. If a node has multiple pods labeled with security=high, the critical-pod can potentially be scheduled on that node, given other scheduling constraints are met. 5. The critical-pod exceeds the defined memory request quota as per the compute-quota.
	Easy Resource limits Pods Containers	3 mins Kubernetes	Solve
How would you deploy a Kubernetes pod with the following specifications: - Name of pod: adaface - Resource limits: 1 CPU and 512Mi memory - Image: haproxy A: kubectl run adaface --image=haproxy --limits='cpu=1,memory=512Mi' B: kubectl run adaface --image=haproxy --requests='cpu=1,memory=512Mi'

	🧐 Question	🔧 Skill	💪 Difficulty	⌛ Time
	Error Budget Management Latency Monitoring Error Budgets Distributed Tracing	Site Reliability Engineering	Medium	3 mins	Solve
You are a site reliability engineer responsible for maintaining a microservices-based e-commerce platform. Your system consists of several independent services, each deployed on its separate container within a Kubernetes cluster. Your organization follows a strict Service Level Objective (SLO) to maintain user satisfaction, which mandates that the 95th percentile latency for all requests over a 30-day period should not exceed 200 ms. The following pseudo-code represents a simplified version of the request processing in your system: You realize that over the first two weeks of the current 30-day window, the 95th percentile latency has risen to 250 ms. Analyzing further, you discover that out of 10 million requests, 600,000 requests took more than 200 ms to complete. Given these facts, which of the following is the most effective course of action that you can take to troubleshoot and reduce the system's latency issues? A: Change the latency log level to debug to gather more information. B: Increase the SLO for latency to 250 ms to accommodate the current system performance. C: Introduce more instances of each microservice to handle the increased load. D: Implement a distributed tracing mechanism to identify the microservices contributing most to the latency. E: Implement request throttling to reduce the overall number of requests.
	Incident Response Procedure Incident Management Disaster Recovery System Optimization	Site Reliability Engineering	Medium	3 mins	Solve
You are an SRE for a large-scale distributed system. The system architecture includes five primary servers (P1 to P5) and three backup servers (B1 to B3). The system uses an advanced load balancer that distributes the workload across the primary servers evenly. One day, the monitoring system triggers an alert that server P5 is not responding. The pseudo-code for the current incident response procedure is as follows: The function 'replaceServer(server)' replaces the failed server with a new one from a pool of spare servers, which takes around 30 minutes. The current discussion revolves around modifying this procedure to improve system resilience and minimize potential downtime. The backup servers are underutilized and could be leveraged more effectively. Also, the load balancer can dynamically shift workloads based on server availability and response time. Based on the situation above, what is the best approach to optimize the incident response procedure? A: Implement an early warning system to predict server failures and prevent them. B: Upon failure detection, immediately divert traffic to backup servers, then attempt to reboot the primary server, and replace if necessary. C: Replace the failed server without attempting a reboot and keep the traffic on primary servers. D: Enable auto-scaling to add more servers when a primary server fails. E: Switch to a more advanced load balancer that can detect and handle server failures independently.
	Service Balancer Decision-making Load Balancing Distributed Systems Concurrent Processing	Site Reliability Engineering	Medium	2 mins	Solve
You are a Site Reliability Engineer (SRE) working on a distributed system with a load balancer that distributes requests across a number of servers based on the current load. The decision algorithm for load balancing is written in pseudo-code as follows: The system receives a large burst of requests. In response to this, some engineers propose increasing the `threshold` value to allow for more requests to be handled concurrently by each server. Others argue that instead, we should increase the number of servers to distribute the load more evenly. Consider that the system has auto-scaling capabilities based on the average load of all servers, but the scaling operation takes about 15 minutes to add new servers to the pool. Also, the servers' performance degrades sharply if the load is much above the threshold. One of the engineers also proposes modifying the getServer function logic to distribute the incoming load one by one across all servers to trigger the average load to rise faster. Based on this scenario, what is the best approach? A: Increase the `threshold` value to allow more requests on each server. B: Add more servers to distribute the load, regardless of the auto-scaling delay. C: Modify the getServer function to distribute the incoming load one by one across all servers to trigger the average load to rise faster. D: Increase the `threshold` and add more servers simultaneously. E: Manually trigger the auto-scaling process before the load increases.
	Resource Analysis Process Management System Performance Log Analysis	DevOps	Medium	3 mins	Solve
As a senior DevOps engineer, you are tasked with diagnosing performance issues on a Linux server running Ubuntu 20.04. The server hosts several critical applications, but lately, users have been experiencing significant slowness. Initial monitoring shows that CPU and memory utilization are consistently high. To identify the root cause, you check the output of `top` and `ps` commands, which indicate that a particular process is consuming an unusually high amount of resources. However, the process name is generic and does not clearly indicate which application or service it belongs to. You also examine `/var/log/syslog` for any unusual entries but find nothing out of the ordinary. Based on this situation, which of the following steps would most effectively help you identify and resolve the performance issue? A: Increase the server's physical memory and CPU capacity. B: Use the `lsof` command to identify the files opened by the suspect process. C: Reboot the server to reset all processes. D: Examine the `/etc/hosts` file for any incorrect configurations. E: Run the `netstat` command to check for abnormal network activity. F: Check the crontab for any recently added scheduled tasks.
	Streamlined DevOps Continuous Integration Scripting Git Shell Scripting	DevOps	Medium	2 mins	Solve
You are in charge of developing a Bash script for setting up a continuous integration pipeline for a web application. The source code is hosted in a Git repository. The script's goals include: 1. Ensuring the local copy of the repository in /var/www/html is updated to the latest version. 2. Creating a .env file with APP_ENV=production in the project root if it doesn't already exist. 3. Running a test suite with ./run_tests.sh and handling any test failures appropriately. 4. Logging the current timestamp and commit hash in deployment_log.txt in the project root if tests pass. Which of the following script options would most effectively and safely accomplish these tasks?
	Docker Multistage Build Analysis Multistage Builds Optimization Dockerfile Syntax	Docker	Medium	3 mins	Solve
Consider the following Dockerfile, which utilizes multistage builds. The aim is to build a lightweight, optimized image that just runs the application. The Dockerfile first defines a base image that includes Node.js and npm, then it creates an intermediate image to install the npm dependencies. Afterwards, it runs the tests in another stage and finally, creates the release image. Which of the following statements are true? A: The final image will include the test scripts. B: If a test fails, the final image will not be created. C: The node_modules directory in the final image comes from the base image. D: The final image will only contain the necessary application files and dependencies. E: If the application's source code changes, only the release stage needs to be rebuilt.
	Docker Networking and Volume Mounting Interplay Networking Volume Mounting Docker Networking	Docker	Easy	3 mins	Solve
You have two docker containers, X and Y. Container X is running a web service listening on port 8080, and container Y is supposed to consume this service. Both containers are created from images that don't have any special network configurations. Container X has a Dockerfile as follows: And, you build and run it with the following commands: Container Y is also running alpine with python installed, and it's supposed to read data from the `/app/data` directory and send a GET request to `http://localhost:8080` every 5 minutes. The Dockerfile for container B is: And you run it with: Assuming all the python scripts work perfectly and firewall isn't blocking any connections, you find that container Y can't access the web service of container X via `http://localhost:8080` and also it can't read the data in `/app/data` directory. What could be the potential reason(s)? A: Y can't access X's web service because they're in different Docker networks. B: Y can't read the data because the volume is not shared correctly. C: Both A and B are correct. D: Both A and B are incorrect.
	Dockerfile Optimization Dockerfile Multi-stage builds Layer Caching	Docker	Medium	2 mins	Solve
You have been asked to optimize a Dockerfile for a Python application that involves a heavy dependency installation. Here is the Dockerfile you are starting with: Given that the application's source code changes frequently but the dependencies listed in requirements.txt rarely change, how can you optimize this Dockerfile to take advantage of Docker's layer caching, reducing the build time? A: Move the `RUN pip install` command to before the `COPY` command. B: Change `COPY . /app` to `COPY ./app.py /app` and move the `RUN pip install` command to before the `COPY` command. C: Add `RUN pip cache purge` before `RUN pip install`. D: Replace the base image with `python:3.8-slim`. E: Implement multi-stage builds.
	Dockerfile Updates Cache Docker Cache Strategies	Docker	Medium	2 mins	Solve
Check the following Dockerfile used for a project (STAGE 1): We created an image from this Dockerfile on Dec 14 2021. A couple of weeks after Dec 14 2021, Ubuntu released new security updates to their repository. After 2 months, we modified the file (STAGE 2): Couple of weeks later, we further modified the file to add a local file ada.txt to /ada.txt (STAGE 3): (Note that ada.txt exists in /home/adaface and the dockerfile exists in /home/code folders) Pick correct statements: A: If we run “docker build .” at STAGE 2, new Ubuntu updates will be fetched because apt-get update will be run again since cache is invalidated for all lines/layers of Dockerfile when a new line is added. B: If we run “docker build .” at STAGE 2, new Ubuntu updates will not be fetched since cache is invalidated only for last two lines of the updated Dockerfile. Since the first two commands remain the same, cached layers are re-used skipping apt get update. C: To skip Cache, “docker build -no-cache .” can be used at STAGE 2. This will ensure new Ubuntu updates are picked up. D: Docker command “docker build .” at STAGE 3 works as expected and adds local file ada.txt to the image. E: Docker command “docker build .” at STAGE 3 gives an error “no such file or directory” since /home/adaface/ada.txt is not part of the Dockerfile context.
	Efficient Dockerfile Dockerfile Dockerfile Syntax Containerization Resource Optimization	Docker	Medium	2 mins	Solve
Review the following Dockerfiles that work on two projects (project and project2): All Docker files have the same end result: - ‘project’ is cloned from git. After running few commands, ‘project’ code is removed. - ‘project2’ is copied from file system and permissions to the folder is changed. Pick the correct statements: A: File 1 is the most efficient of all. B: File 2 is the most efficient of all. C: File 3 is the most efficient of all. D: File 4 is the most efficient of all. E: Merging multiple RUN commands into a single RUN command is efficient for ‘project’ since each RUN command creates a new layer with changed files and folders. Deleting files with RUN only marks these files as deleted but does not reclaim disk space. F: Copying ‘project2’ files and changing ownership in two separate commands will result in two layers since Docker duplicates all the files twice.
	ConfigMap and Secrets Interaction Resource Management Security Kubernetes Configuration Kubernetes Security	Kubernetes	Medium	2 mins	Solve
In a Kubernetes cluster, you are working on configuring a new deployment that should be able to access specific environment variables through both ConfigMap and Secrets resources. The deployment YAML is structured as follows: You have applied the above YAML successfully without any errors. Now, you are about to configure a service to expose the deployment. Before doing that, you want to confirm the security and setup implications. Based on the above configuration, which of the following statements are true? 1. The DATABASE_PASSWORD will be mounted as an environment variable in plain text. 2. The ConfigMap data can be updated and the changes will be reflected automatically in the running pods without any need for a redeployment. 3. If a potential attacker gains access to the cluster, they would be able to retrieve the DATABASE_PASSWORD in plain text from the secrets resource as it is defined in stringData. 4. The APP_ENV and DATABASE_URL values are securely stored and cannot be accessed by non-admin users. 5. If a new container in the same pod is created, it would automatically have the DATABASE_PASSWORD environment variable configured.
	Ingress from namespace Network Network Policies Kubernetes Network Policy Yaml Configuration	Kubernetes	Medium	3 mins	Solve
You are tasked with deploying a Kubernetes network policy. Here are the specifications: - Name of the policy: adaface-namespace - Policy to be deployed in ‘chatbot’ namespace - The policy should allow ALL traffic only from ‘tester’ namespace - Policy should not allow communication between pods in the same namespace - Traffic only from ‘tester’ namespace is allowed on all ports Which of the following configuration files is BEST suited to create required dependencies and deploy the network policy?
	Pod Affinity and Resource Quota Compliance Pod Scheduling Resource Management Kubernetes Deployment	Kubernetes	Medium	2 mins	Solve
You are working on a Kubernetes project where you need to ensure that certain pods get scheduled on nodes based on the presence of other pods and to limit the amount of resources that can be consumed in a namespace. You have been given the following YAML file which contains a combination of a pod definition and a resource quota: With the application of the above YAML configuration, assess the validity of the statements and choose the correct option that lists all the true statements. 1. The critical-pod will only be scheduled on nodes where at least one pod with a label security=high is already running. 2. The critical-pod is adhering to the resource quotas defined in the compute-quota. 3. The compute-quota restricts the namespace to only allow a total of 1 CPU and 1Gi memory in requests and 2 CPUs and 2Gi memory in limits across all pods. 4. If a node has multiple pods labeled with security=high, the critical-pod can potentially be scheduled on that node, given other scheduling constraints are met. 5. The critical-pod exceeds the defined memory request quota as per the compute-quota.
	Resource limits Pods Containers	Kubernetes	Easy	3 mins	Solve
How would you deploy a Kubernetes pod with the following specifications: - Name of pod: adaface - Resource limits: 1 CPU and 512Mi memory - Image: haproxy A: kubectl run adaface --image=haproxy --limits='cpu=1,memory=512Mi' B: kubectl run adaface --image=haproxy --requests='cpu=1,memory=512Mi'

Get started for free

Preview questions

With Adaface, we were able to optimise our initial screening process by upwards of 75%, freeing up precious time for both hiring managers and our talent acquisition team alike!

Brandon Lee, Head of People, Love, Bonito

It's very easy to share assessments with candidates and for candidates to use. We get good feedback from candidates about completing the tests. Adaface are very responsive and friendly to deal with.

Kirsty Wood, Human Resources, WillyWeather

We were able to close 106 positions in a record time of 45 days! Adaface enables us to conduct aptitude and psychometric assessments seamlessly. My hiring managers have never been happier with the quality of candidates shortlisted.

Amit Kataria, CHRO, Hanu

We evaluated several of their competitors and found Adaface to be the most compelling. Great library of questions that are designed to test for fit rather than memorization of algorithms.

Swayam Narain, CTO, Affable

Why you should use Pre-employment Site Reliability Test?

The Site Reliability Test makes use of scenario-based questions to test for on-the-job skills as opposed to theoretical knowledge, ensuring that candidates who do well on this screening test have the relavant skills. The questions are designed to covered following on-the-job aspects:

Understanding of system design and architecture principles
Proficiency in Infrastructure as Code (IaC)
Experience with continuous integration/continuous deployment (CI/CD) tools and processes
Knowledge of networking concepts and protocols
Familiarity with monitoring and logging systems
Ability to handle incident management and perform post-mortem analysis
Experience in performance tuning and load balancing
Understanding of database reliability and scalability
Knowledge of security principles and best practices
Proficiency in disaster recovery planning and execution

Once the test is sent to a candidate, the candidate receives a link in email to take the test. For each candidate, you will receive a detailed report with skills breakdown and benchmarks to shortlist the top candidates from your pool.

What topics are covered in the Site Reliability Test?

System Design and Architecture: This skill assesses the candidate's ability to design and architect complex systems, considering factors like scalability, availability, and performance. It is crucial to measure this skill in the test as it forms the foundation for building reliable and efficient software infrastructures.

Infrastructure as Code (IaC): This skill evaluates the candidate's proficiency in using tools and techniques to define and manage infrastructure through code. By measuring this skill, we can ensure that the candidate is capable of automating infrastructure provisioning and maintaining consistency in configuration, leading to increased operational efficiency and reducing manual errors.

Continuous Integration/Continuous Deployment (CI/CD): This skill measures the candidate's understanding and application of automated processes for building, testing, and deploying software. It is essential to assess this skill as it enables organizations to release software rapidly and frequently, ensuring that changes are thoroughly tested, minimizing potential issues, and achieving faster time-to-market.

Understanding of Networking Concepts: This skill assesses the candidate's knowledge of networking fundamentals, including TCP/IP, DNS, routing, and network protocols. It is crucial to measure this skill to ensure the candidate can design and troubleshoot network configurations, optimize network performance, and implement secure and reliable communication between different components of the system.

Monitoring and Logging Systems: This skill evaluates the candidate's ability to implement and utilize monitoring and logging systems to gain insights into application performance, detect issues, and troubleshoot problems. Measuring this skill helps in ensuring proper observability of the system, facilitating proactive monitoring, efficient debugging, and continuous improvement of the overall reliability of the infrastructure.

Incident Management and Post-Mortem Analysis: This skill measures the candidate's knowledge and experience in handling incidents, coordinating response efforts, and conducting post-mortem analysis to identify root causes and prevent recurrence. Assessing this skill is essential as it demonstrates the candidate's ability to effectively manage and mitigate the impact of incidents, improve system reliability, and implement necessary corrective measures to avoid similar incidents in the future.

Performance Tuning and Load Balancing: This skill evaluates the candidate's expertise in optimizing system performance and distributing workload across multiple resources to ensure scalability and high availability. Measuring this skill is crucial as it enables organizations to deliver responsive applications and handle increased traffic without compromising performance, thus ensuring a smooth user experience and minimal downtime.

Database Reliability and Scalability: This skill assesses the candidate's understanding of database technologies, their reliability, and scalability aspects. Measuring this skill is important as it helps ensure that the candidate can design, monitor, and optimize database systems, enabling efficient data storage, retrieval, and high availability while maintaining data integrity and performance.

Understanding of Security Principles: This skill measures the candidate's grasp of security concepts and best practices, including authentication, authorization, encryption, and vulnerability management. Assessing this skill is crucial as it allows organizations to safeguard their systems and data against unauthorized access, maintain compliance with regulatory requirements, and protect sensitive information from potential threats and attacks.

Disaster Recovery Planning and Execution: This skill evaluates the candidate's ability to develop and implement plans for disaster recovery, ensuring business continuity in case of catastrophic events. Measuring this skill is important as it demonstrates the candidate's capability to minimize downtime, recover data and infrastructure, and restore services quickly, effectively reducing the impact of disruptions on the organization.

Microservices and Containerization: This skill assesses the candidate's understanding and proficiency in designing and implementing microservices architectures and utilizing containerization technologies such as Docker and Kubernetes. Measuring this skill is valuable as it allows organizations to build scalable, decoupled, and manageable systems that can be deployed and operated efficiently, enabling rapid development, deployment, and scalability of services.

Service Level Objectives (SLOs) and Error Budgets: This skill measures the candidate's knowledge and application of defining, tracking, and meeting service level objectives, as well as managing error budgets. Assessing this skill is essential as it helps organizations establish and maintain service reliability, make data-driven decisions about feature development and infrastructure investments, and prioritize efforts for improving system performance and availability.

Traffic Management and Distributed Systems: This skill evaluates the candidate's ability to manage and distribute incoming traffic efficiently across multiple resources in distributed systems. Measuring this skill is crucial as it enables organizations to handle high traffic loads, improve system performance, and ensure fault tolerance and scalability, resulting in a better user experience and increased system reliability.

High Availability and Resiliency Strategies: This skill assesses the candidate's knowledge and application of strategies and techniques for achieving high availability and ensuring system resiliency against failures. Measuring this skill is important as it enables organizations to minimize the impact of outages, maintain continuous service availability, and provide an uninterrupted user experience even in the face of unexpected circumstances or component failures.

Capacity Planning and Resource Optimization: This skill measures the candidate's ability to analyze system capacity requirements, optimize resource allocation, and plan for future growth. Assessing this skill is crucial as it enables organizations to effectively manage infrastructure costs, avoid performance bottlenecks or resource shortages, and ensure optimal utilization of resources, leading to efficient and cost-effective operations.

Full list of covered topics

The actual topics of the questions in the final test will depend on your job description and requirements. However, here's a list of topics you can expect the questions for Site Reliability Test to be based on.

Site Reliability Engineering

DevOps Methodologies

Docker

Kubernetes

System Design

Infrastructure as Code

Continuous Integration

Continuous Deployment

Networking Concepts

Monitoring Systems

Logging Systems

Incident Management

Post-Mortem Analysis

Performance Tuning

Load Balancing

Database Reliability

Database Scalability

Security Principles

Disaster Recovery Planning

Disaster Recovery Execution

Microservices

Containerization

Service Level Objectives

Error Budgets

Traffic Management

Distributed Systems

High Availability

Resiliency Strategies

Capacity Planning

Resource Optimization

What roles can I use the Site Reliability Test for?

Site Reliability Engineer (SRE)
Junior Site Reliability Engineer
Senior Site Reliability Engineer

How is the Site Reliability Test customized for senior candidates?

For intermediate/ experienced candidates, we customize the assessment questions to include advanced topics and increase the difficulty level of the questions. This might include adding questions on topics like

Understanding of microservices and containerization techniques
Capability in defining service level objectives (SLOs) and error budgets
Knowledge of traffic management and distributed systems
Expertise in high availability and resiliency strategies
Experience in capacity planning and resource optimization
Ability to troubleshoot and debug complex issues
Proficiency in scripting and automation
Knowledge of cloud platforms and services
Expertise in virtualization technologies
Understanding of version control systems and Git

Preview this test

View sample scorecard

Try the most advanced candidate assessment platform

AI Cheating Detection with Honestly

ChatGPT Protection

Non-googleable Questions

Web Proctoring

IP Proctoring

Webcam Proctoring

MCQ Questions

Coding Questions

Typing Questions

Personality Questions

Custom Questions

Ready-to-use Tests

Custom Tests

Custom Branding

Bulk Invites

Public Links

ATS Integrations

Multiple Question Sets

Custom API integrations

Role-based Access

Priority Support

GDPR Compliance

Screen candidates in 3 easy steps

Pick a test from over 500+ tests

The Adaface test library features 500+ tests to enable you to test candidates on all popular skills- everything from programming languages, software frameworks, devops, logical reasoning, abstract reasoning, critical thinking, fluid intelligence, content marketing, talent acquisition, customer service, accounting, product management, sales and more.

Invite your candidates with 2-clicks

Make informed hiring decisions

Get started for free

Preview questions

Have questions about the Site Reliability Hiring Test?

What is Site Reliability Test?

The Site Reliability Test evaluates candidates' skills in site reliability engineering, focusing on system design, infrastructure, CI/CD, and related areas. It's used by recruiters to find candidates proficient in maintaining reliable and scalable systems.

Can I combine Site Reliability Test with AWS questions?

Yes, you can create a custom test that includes AWS questions. We also offer a dedicated AWS DevOps Test to assess AWS skills in detail.

What skills are assessed in senior roles for the Site Reliability Test?

For senior roles, the test evaluates:

Microservices and containerization techniques
Service level objectives and error budgets
Traffic management and distributed systems
Resiliency strategies
Capacity planning and resource optimization

How to use Site Reliability Test in my hiring process?

Use our assessment as a pre-screening tool at the start of your hiring process. Add a link to the test in your job post or invite candidates via email.

What are the main DevOps tests?

The main DevOps tests include:

Can I combine multiple skills into one custom assessment?

Yes, absolutely. Custom assessments are set up based on your job description, and will include questions on all must-have skills you specify. Here's a quick guide on how you can request a custom test.

Do you have any anti-cheating or proctoring features in place?

We have the following anti-cheating features in place:

Hidden AI Tools Detection with Honestly
Non-googleable questions
IP proctoring
Screen proctoring
Web proctoring
Webcam proctoring
Plagiarism detection
Secure browser
Copy paste protection

Read more about the proctoring features.

How do I interpret test scores?

The primary thing to keep in mind is that an assessment is an elimination tool, not a selection tool. A skills assessment is optimized to help you eliminate candidates who are not technically qualified for the role, it is not optimized to help you find the best candidate for the role. So the ideal way to use an assessment is to decide a threshold score (typically 55%, we help you benchmark) and invite all candidates who score above the threshold for the next rounds of interview.

What experience level can I use this test for?

Each Adaface assessment is customized to your job description/ ideal candidate persona (our subject matter experts will pick the right questions for your assessment from our library of 10000+ questions). This assessment can be customized for any experience level.

Does every candidate get the same questions?

Yes, it makes it much easier for you to compare candidates. Options for MCQ questions and the order of questions are randomized. We have anti-cheating/ proctoring features in place. In our enterprise plan, we also have the option to create multiple versions of the same assessment with questions of similar difficulty levels.

I'm a candidate. Can I try a practice test?

No. Unfortunately, we do not support practice tests at the moment. However, you can use our sample questions for practice.

What is the cost of using this test?

You can check out our pricing plans.

Can I get a free trial?

Yes, you can sign up for free and preview this test.

I just moved to a paid plan. How can I request a custom assessment?

Here is a quick guide on how to request a custom assessment on Adaface.

View sample scorecard

Along with scorecards that report the performance of the candidate in detail, you also receive a comparative analysis against the company average and industry standards.

View sample scorecard