Sign In
Sign In

Top Kubernetes Interview Questions and Answers

Top Kubernetes Interview Questions and Answers
Hostman Team
Technical writer
Infrastructure

In today's tech landscape, the Kubernetes container orchestration platform is widely used across various projects. With its increasing popularity and widespread adoption, Kubernetes often comes up during interviews for certain IT roles, including DevOps, SRE, system administration, development, and operations. The questions can range from very simple ones about cluster components to more advanced topics like networking within the cluster and network policies. In this article, we’ll go over the top Kubernetes interview questions and provide detailed answers.

What is Kubernetes?

Kubernetes is an open-source platform for managing containerized applications. It enables the deployment, scaling, and management of containerized workloads and services.

List the Main Components of a Kubernetes Cluster

At the core of Kubernetes lies the Control Plane, which resides on the master node. The Control Plane includes the following components:

  • kube-api-server – The API server processes REST requests and serves as the "brain" of the cluster. All interactions, including object creation and deletion, go through the API server, which also manages communication between cluster components.
  • etcd – A highly available key-value store that saves configuration data and cluster state. It can be deployed externally for improved fault tolerance. etcd is an independent project maintained by a separate team.
  • kube-scheduler – The component responsible for determining which nodes will run which pods. It monitors available resources on each node to balance workload distribution.
  • kube-controller-manager – Runs controllers that monitor resources and ensure the cluster matches the desired state by making necessary changes.
  • kube-proxy – A network service that acts as a load balancer. It distributes network traffic between pods and runs on every node in the cluster.

What is a Pod in Kubernetes?

A Pod is the smallest deployable unit in Kubernetes and serves as an abstraction for running containers. A pod usually contains one or more containers, its own IP address, and data storage. Kubernetes doesn’t interact directly with containers, but rather through pods.

What is the difference between Deployment and StatefulSet?

Both Deployment and StatefulSet are Kubernetes objects for managing applications, but they serve different purposes.

Deployment:

  • Used for managing stateless applications (e.g., web servers).
  • Supports rolling updates for zero-downtime deployments.
  • Pods are ephemeral with non-persistent names and IPs.
  • No state persistence: when a pod is deleted, its data is lost.

StatefulSet:

  • Designed for stateful applications (e.g., databases).
  • Pods have stable, unique names and identifiers that persist across restarts.
  • Supports Persistent Volumes to retain data between restarts.
  • Pods are created and terminated in a specific order, one at a time.

In conclusion, data persistence is the main difference between a Deployment and a StatefulSet. Use Deployment if the application does not require state to be preserved. However, if the application needs to retain its state, then a StatefulSet is the appropriate choice.

What is a Service in Kubernetes, and What are the Types?

A Service in Kubernetes defines how to access a set of pods. It provides a stable IP and DNS name, allowing internal or external communication with pods.

Types of Services:

  1. ClusterIP – The default type. Exposes the service on an internal IP, accessible only within the cluster.
  2. NodePort – Exposes the service on a specific port across all nodes. Allows external access via NodeIP:NodePort.
  3. LoadBalancer – Provisions an external load balancer (mainly in cloud environments) and assigns a public IP for external traffic distribution.
  4. ExternalName – Maps the service name to an external hostname or IP address using a DNS CNAME record. Works purely at the DNS level.

What is Ingress in Kubernetes?

Ingress is a Kubernetes object that defines rules for routing external HTTP/HTTPS traffic to internal services within the cluster. It enables fine-grained control over how traffic is handled and directed.

What is an Ingress Controller?

An Ingress Controller is a component that implements the Ingress rules. It typically consists of:

  • A reverse proxy (e.g., Nginx, HAProxy)
  • A controller that interacts with the Kubernetes API server to apply Ingress configuration and routing rules.

The controller watches for changes to Ingress objects and configures the reverse proxy accordingly to handle incoming traffic.

How to Store Sensitive Data (Secrets), Including Logins, Passwords, Tokens, and Keys?

Kubernetes provides the Secret object for storing sensitive information. There are six types of secrets:

  1. Opaque – A general-purpose secret type used to store any data.
  2. Service Account Token – Used to work with service accounts by generating a JWT token. Typically, the token is automatically created when a service account object is created.
  3. Basic Auth – Stores login and password in Base64-encoded format.
  4. SSH Auth – Used for SSH authentication. The secret contains a pre-generated private key.
  5. TLS Certificates – Involves using certificates and their private keys, provided in the manifest's tls.crt and tls.key fields (Base64-encoded). 
  6. Bootstrap Token – A special token type used to add new nodes to the Kubernetes cluster safely.

Secrets are usually injected into containers via volumeMount or secretKeyRef.

You can also use external secret management tools like HashiCorp Vault.

What Are Labels and Selectors, and What Are They Used For?

  • Labels are key-value metadata that can be attached to any Kubernetes object. They help to identify attributes of objects that are not directly related to the running services but can provide useful information to users — for example, the purpose of a deployed application or the environment in which it will run. In other words, labels are intended to distinguish between different instances of objects.
  • Selectors are used to filter or query objects based on their labels. A selector is a request to fetch objects that match specific label criteria.

What Are Probes in Kubernetes, What Types Exist, and What Are They Used For?

Probes in Kubernetes check the health and readiness of applications. There are three types:

  1. Liveness Probe: Checks whether a pod is running correctly. If the check fails, the pod is restarted automatically.
  2. Readiness Probe: Checks whether a pod is ready to receive network traffic. If it fails, the pod is excluded from load balancing, though it continues running.
  3. Startup Probe: Used for apps that take a long time to start. This probe checks the app's initial startup before liveness and readiness checks are activated.

What Is Pod Disruption Budget (PDB) and What Is It Used For?

Pod Disruption Budget is a Kubernetes feature used to ensure a minimum number of pods are available during voluntary disruptions (e.g., node maintenance or upgrades).

Example: If you have an application with 3 replicas that can tolerate the loss of 1 pod, then the PDB should specify that no more than 1 pod can be unavailable at any time. This prevents disruptions that would make the application non-functional.

How to Control Resource Usage in Containers?

Use requests and limits in your pod definitions:

  • Requests define the minimum amount of CPU and memory required for a pod to be scheduled. If the cluster doesn't have enough resources, the pod won't be scheduled.
  • Limits define the maximum amount of CPU and memory a pod can consume. The pod will be throttled or terminated if it exceeds these limits.

You can learn more about Kubernetes requests and limits in our article.

How to Expose an Application Running in Kubernetes to the External Network?

To provide external access to an application, you can use:

  • Ingress Controller – A preferred method for managing HTTP/HTTPS access. It routes traffic to services based on defined rules.
  • NodePort – Opens a specific port on all nodes for external access.
  • LoadBalancer – Provisions an external IP through a cloud load balancer.

What Is the CNI Interface?

CNI (Container Network Interface) is a Kubernetes specification maintained by the Cloud Native Computing Foundation. It defines how network interfaces are managed in Linux containers. CNI is responsible for connecting pods to the network.

CNI features are implemented through plugins, with popular ones including:

  • Calico
  • Weave
  • Flannel
  • Cilium

What Is CRI?

CRI (Container Runtime Interface) is the primary communication interface between the kubelet component in a Kubernetes cluster and the container runtime environment. Using CRI, Kubernetes interacts with the container engine responsible for creating and managing containers (Kubernetes itself does not create containers directly). 

Popular container runtimes that implement CRI include containerd and CRI-O.

What Is a Persistent Volume (PV)?

A Persistent Volume (PV) is a Kubernetes object used to store data persistently across pod lifecycles. Volumes in Kubernetes are implemented via plugins, and the platform supports the following types:

  • Container Storage Interface (CSI)
  • Fibre Channel (FC)
  • hostPath
  • iSCSI
  • Local Storage
  • Network File System (NFS)

What Is a Persistent Volume Claim (PVC)?

A Persistent Volume Claim (PVC) is a user request for storage resources. It allows users to claim a portion of a Persistent Volume based on parameters such as requested size and access mode. PVCs enable dynamic provisioning of storage in Kubernetes, meaning the cluster can automatically create a volume that matches the claim.

How to Assign Access Rights in a Kubernetes Cluster?

Kubernetes manages access control using RBAC (Role-Based Access Control). RBAC allows administrators to define who can do what within the cluster using the following entities:

  • Role – Defines a set of permissions within a specific namespace.
  • RoleBinding – Assigns a Role to a user or group within a namespace.
  • ClusterRole – Grants permissions across the entire cluster (not limited to a single namespace).
  • ClusterRoleBinding – Binds a ClusterRole to users or groups across all namespaces.
  • ServiceAccount – An identity used by Kubernetes workloads (pods) to interact with the API.

Conclusion

In this article, we covered a list of common interview questions that candidates might encounter when applying for IT roles involving Kubernetes. These questions span a range of foundational and advanced topics, including architecture, security, networking, and storage in Kubernetes.

Infrastructure

Similar

Infrastructure

VMware Cloud Director: What It Is and How to Use It

VMware Cloud Director (formerly vCloud Director, or “vCD”) is a modern solution for cloud providers, mainly designed for building virtual data centers on top of physical infrastructure. The platform allows combining all of a data center’s physical resources into virtual pools, which are then offered to end users on a rental basis. It integrates tightly with VMware’s own technologies: vCenter and vSphere. vCenter is a set of tools for managing virtual infrastructure, and vSphere is the virtualization platform for cloud computing. Key Capabilities of VMware Cloud Director Creation of virtual data centers (vDCs) with full isolation of virtual services and resources. Migration of virtual machines (VMs) between clouds, and self-deployment of OVF templates. Snapshots and rollback of VM changes. Creation of isolated and routable networks with external access. Integrated, tiered storage with load balancing between virtual machines. Network security: perimeter protection and firewalling. Encryption of access to cloud resources to secure the virtual infrastructure. Unified authentication across all VMware services (single sign-on) so users don’t need to re-authenticate. Deployment of multi‑tier applications as ready-made virtual appliances, with VMs and OS images. Allocation of isolated resources for different departments within a single virtual structure. How VMware Cloud Director Works VMware Cloud Director uses a multi-tenant model. Rather than building a dedicated environment for every customer, it creates a shared virtual environment. This reduces infrastructure maintenance costs massively: for large cloud providers, savings can reach hundreds of thousands or even millions of dollars per year, which in turn lowers the rental cost for end users. Resource consumption model: Using vCenter and vSphere, the provider aggregates physical resources into a shared pool called a “virtual data center” (vDC). From that pool, resources are allocated into Org vDCs (Organizational Virtual Data Centers), which are the fundamental compute units consumed by customers. VMware Cloud Director syncs with the vSphere database to request and allocate the required amount of resources. Org vDCs are containers of VMs and can be configured independently. Customers can order different numbers of Org vDCs for different purposes, e.g., one Org vDC for marketing, another for finance, a third for HR. At the same time, interconnectivity can be established between these Org vDCs, forming a large, virtual private data center. It’s also possible to combine Org vDCs into multiple networks. Additionally, within those networks, one can create vApps (virtual applications) made up of VMs, each with their own gateways to connect to Org vDCs. This setup allows building virtual networks of any architecture, isolated or routable, to match various business needs. When such a network is created, the provider assigns a user from the customer organization to the role of network administrator. A unique URL is also assigned to each organization. The administrator is responsible for adding or removing users, assigning roles and resources, creating network services, and more. They also manage connections to services provided by the cloud provider. For instance, VM templates or OVF/OVA modules, which simplify backup and VM migration. Resource Allocation Models in VMware Cloud Director VMware Cloud Director supports several models for allocating resources, depending on how you want to manage usage: Allocation Pool: You set resource limits and also define a guaranteed percentage of the shared pool for a user. This  model is good when you want predictable costs but don’t need full reservation. Pay-As-You-Go: No guaranteed resources, only consumption-based; ideal if usage is variable. The model is flexible and fits users who want to grow gradually. Reservation Pool: You reserve all available resources; user requests are limited only by what the provider’s data center can supply. Reservation Pool is suited for organizations that need fixed performance and large infrastructure. Useful Features of VMware Cloud Director Here are several powerful features that optimize resource usage, routing, and tenant isolation: Delegation of Privileges You can assign network administrators from the users of each organization. These admins get broad rights: they can create and manage VMs, deploy OVF/OVA templates, manage VM migration, set up isolated/routable networks, balance VM workloads, and more. Monitoring and Analytics Cloud Director includes a unified system for monitoring and analyzing VM infrastructure: VMs, storage, networks, memory. All data is logged and visualized in a dedicated dashboard, making it easier to detect and resolve problems proactively. Networking Features Networking in vCloud Director supports dynamic routing, distributed firewalls, hybrid cloud integration, and flexible traffic distribution. Many of these features are now standard in the newer versions of Cloud Director. If you don’t already have some of them, you may need to upgrade your NSX Edge and convert it to an Advanced Gateway in the UI. Dynamic routing improves reliability by eliminating manual route configuration. You can also define custom routing rules based on IP/MAC addresses or groups of servers. With NSX Edge load balancing, incoming traffic can be distributed evenly across pools of VMs selected by IP, improving scalability and performance. Access Control and More You can create custom user roles in the Cloud Director UI to control access tailored to organizational needs. VMs can be pinned to specific ESXi host groups (affinity rules), which helps with licensing or performance. If Distributed Resource Scheduler (DRS) is supported, Cloud Director can automatically balance VMs across hosts based on load. Additional useful features include automatic VM discovery and import, batch updating of server cluster cells, and network migration tools.
25 November 2025 · 5 min to read
Infrastructure

Why Developers Use the Cloud: Capabilities and Advantages

Today, up to 100% of startups begin operating based on providers offering services ranging from simple virtual hosting to dedicated servers. In this article, we will examine the advantages of cloud computing that have led to its dominance over the “classic” approach of having a dedicated server in a separate room. Cloud Use Cases Typical scenarios for using cloud technologies include: Full migration of a business application to a remote server. For example, enterprise resource planning or accounting software. These applications support operation via remote desktop interfaces, thin clients, or web browsers. Migration of specific business functions. Increasingly, archival copies are stored in the cloud while software continues running locally. Alternatively, a backup SQL server node can be hosted remotely and connected in case the local server fails. Implementation of new services. Businesses are increasingly adopting automated systems for data collection and analytics. For example, Business Intelligence (BI) technologies have become popular, helping generate current and comparative reports. Interaction between local and cloud environments. Hybrid services are well established in large networks. For example, a retail store may operate a local network with an on-site server, receive orders from an online store, and send requests back to transport companies, and so on.This setup allows offline operation even if the internet is fully disconnected: processing sales, receiving shipments, conducting inventories, with automatic synchronization once connectivity is restored. These examples represent foundational scenarios, giving developers plenty of room to innovate. This is one reason more and more coders are attracted to the cloud. Advantages Now let’s examine the advantages and disadvantages of cloud computing. Yes, the technology has some drawbacks, including dependency on internet bandwidth and somewhat higher requirements for IT specialists. Experienced professionals may need retraining, whereas younger personnel who learn cloud technologies from the start do not face such challenges. Speed Software development often requires significant time and effort for application testing. Applications must be verified across multiple platforms, resolutions, and device types. Maintaining local machines dedicated to testing is inefficient. Cloud computing solves this by enabling rapid deployment of virtually any environment, isolated from other projects, ensuring it does not interfere with team development. High deployment speed and access to cloud services also encourage IT startups to launch almost “from scratch,” with minimal resource investment. The advantages of cloud services are especially critical when development volumes periodically expand. Purchasing hardware consumes a developer’s most valuable resource: time. In the cloud, selecting a plan takes just a few minutes, and the setup of a remote host for specific tasks can begin immediately. Hardware resources on the remote server, such as CPU cores, memory, and storage, can also be easily adjusted. Security Building a private server is expensive. Besides the powerful machines, you will need backup power and internet lines, a separate room with air conditioning and fire protection, and security personnel to prevent unauthorized access. Cloud providers automatically provide all these features at any service level. Other security advantages include: Easier identity and access management (IAM). Higher reliability for continuous business operations. Protection against theft or seizure of storage devices containing sensitive data. On a cloud server, users cannot simply plug in a USB drive to download files. Data does not reside on local machines, and access is controlled according to company policy. Users only see what their role allows. This approach reduces the risk of viruses and accidental or intentional file deletion. Antivirus software runs on cloud platforms, and backups are automatically maintained. Cost Efficiency Purchasing server hardware is a major budget burden, even for large corporations. Before the cloud boom, this limited IT development. Modern developers often need test environments with unique infrastructure, which may only be required temporarily. Buying hardware for a one-time test is inefficient. Short-term rental of cloud infrastructure allows developers to complete tasks without worrying about hardware maintenance. Equipment costs directly impact project pricing and developer competitiveness, so cloud adoption is advantageous. Today, most software is developed for cloud infrastructure, at least with support for it. Maintenance, storage, and disposal costs for IT equipment also add up. Hardware becomes obsolete even if unused. This makes maintaining developer workstations for “simple” desktop software costly. Offloading this to a cloud provider allows developers to always work with the latest infrastructure. Convenience Another cloud advantage is ease of use. Cloud platforms simplify team collaboration and enable remote work. The platform is accessible from any device: desktop, laptop, tablet, or smartphone, allowing work from home, the office, or even a beach in Bali. Clouds have become a foundation for remote work, including project management. Other conveniences include: Easy client demonstrations: Developers can grant access and remotely show functionality, or run it on the client’s office computer without installing additional components. Quick deployment of standard solutions: Setting up an additional workstation takes only a few minutes, from registering a new user to their trial login. New developers can quickly join ongoing tasks. Easy role changes: In dynamic teams, personnel often switch between projects. Access to project folders can be revoked with a few clicks once a task is completed. This also applies to routine work: adding new employees, blocking access for former staff, or reassigning personnel. A single administrative console provides an overview of activity and simplifies version tracking, archiving, and rapid deployment during failures. Stability Another factor affecting developer success is the speed of task completion. Beyond rapid deployment, system stability is critical. On local machines, specialists depend on hardware reliability. A failure could delay project timelines due to hardware replacement and configuration. Moving software testing to the cloud enhances the stability of local IT resources, particularly in hybrid systems. Cloud data centers provide Tier 3 minimum reliability (99.982% uptime) without additional client investment. Resources are pre-provisioned and ready for use according to the chosen plan. Development, testing, and operation are typically conducted within a single provider’s platform, in an environment isolated from client services. Conclusion Cloud technologies offer numerous advantages with relatively few drawbacks. Businesses and individual users value these benefits, and developers are encouraged to follow trends and create new, in-demand products. Virtually all commerce has migrated to the cloud, and industrial sectors, especially those with extensive branch networks and remote facilities, are also adopting cloud solutions.
25 November 2025 · 6 min to read
Infrastructure

PostgreSQL vs MySQL: Which Database Is Right for Your Business?

PostgreSQL and MySQL are among the most popular relational databases. In this article, we will examine the functional differences between them and compare their performance so that you can choose the database that is suitable for your business. PostgreSQL vs MySQL Despite the increasing similarity in features between PostgreSQL and MySQL, important differences remain. For example, PostgreSQL is better suited for managing large and complex databases, while MySQL is optimal for website and online-application databases because it is oriented toward speed. This follows from the internal structure of these relational database systems, which we will examine. Data Storage in PostgreSQL and MySQL Like any other relational databases, these systems store data in tables. However, MySQL uses several storage engines for this, while PostgreSQL uses only a single storage engine. On one hand, this makes PostgreSQL more convenient, because MySQL’s engines read and write data to disk differently. On the other hand, MySQL offers greater flexibility in choosing a data engine. However, PostgreSQL has an advantage: its storage engine implements table inheritance, where tables are represented as objects. As a result, operations are performed using object-oriented functions. Support The SQL standard is over 35 years old, and only the developers of PostgreSQL aim to bring their product into full compliance with the standard. The developers of MySQL use a different approach: if a certain feature simplifies working with the system, it will be implemented even if it does not fully conform to the standard. This makes MySQL more user-friendly compared to PostgreSQL. In terms of community support, the number of MySQL developers still exceeds those working with PostgreSQL, but you can receive qualified help in both communities. In addition, many free guides and even books have been written about PostgreSQL, containing answers to most questions. It is also worth noting that both platforms are free, but MySQL has several commercial editions, which can sometimes lead to additional expenses. Programming Languages Both systems support a wide range of programming languages. Among the popular ones are C++, Java, Python, lua, and PHP. Therefore, a company’s development team will not face difficulties implementing features in either system. Operating Systems MySQL is a more universal system that runs on Windows, Linux, macOS, and several other operating systems. PostgreSQL was originally designed for Linux, but with the REST API interface, it becomes an equally universal solution that operates on any OS. Data Processing PostgreSQL provides more capabilities for data processing. For example, a cursor is used for moving through table data, and responses are written to the memory of the database server rather than the client, as in MySQL. PostgreSQL also allows building indexes simultaneously for several columns. It supports different index types, allowing work with multiple data types. This database also supports regular expressions in queries. However, new fields in PostgreSQL can only be added at the end of a table. Parallel data processing is better organized in PostgreSQL because the platform has a built-in implementation of MVCC (multiversion concurrency control). MVCC can also be supported in MySQL, but only if InnoDB is used. Concerning replication, PostgreSQL supports logical, streaming, and bidirectional replication, while MySQL supports circular replication as well as master-master and master-standby. Replication refers to copying data between databases located on different servers. PostgreSQL and MySQL: Performance Comparison Testing is fair only when comparing two clean, “out-of-the-box” systems. Indexed testing provides the following results: Insertion: PostgreSQL is more than 2.7× faster, processing a 400,000-record database in 5.5 seconds versus 15 seconds for MySQL. Inner join: PostgreSQL processes 400,000 records in 1.1 seconds, MySQL in 2.8 seconds: a gain of more than 2.5×. Indexed sorting: PostgreSQL processes the same number of records in 0.9 seconds, MySQL in 1.5 seconds. Grouping: For the same 400,000-record database, PostgreSQL achieves 0.35 seconds, MySQL 0.52 seconds. Indexed selection: PostgreSQL is 2× faster: 0.6 seconds vs. 1.2 seconds. When it comes to updating data, PostgreSQL’s update time increases gradually as the number of records grows, while MySQL processes them in roughly the same time, starting from 100,000 records. This is due to different data-storage implementations. Nevertheless, PostgreSQL holds a significant advantage over MySQL even with large data volumes: 3.5 seconds versus 9.5 seconds for 400,000 records—more than 2.7× faster. Without indexes, PostgreSQL also shows surprisingly high performance, processing a 400,000-record database in 1.3, 0.7, and 2.2 seconds for inner join, selection, and update operations, respectively. Thus, PostgreSQL delivers an average performance advantage of about 2× (2.06). Although MySQL was originally positioned as a high-performance platform, constant optimization by the PostgreSQL development team has resulted in greater efficiency. Advantages for Developers Here we consider only the unique features characteristic of each platform. Therefore, we will not discuss support for MVCC or ACID, as these features are present in both systems. From a developer’s perspective, MySQL is advantageous because it: Provides increased flexibility and is easily scalable, with more than ten storage engines based on different data-storage algorithms. Handles small read-oriented databases more efficiently (i.e., without frequent writes). Is easier to manage and maintain, because it requires less configuration and fewer preparatory steps before starting work. From a developer’s perspective, PostgreSQL is advantageous because it: Offers an object-oriented approach to data, enabling inheritance and allowing the creation of more complex table structures that do not fit the traditional relational model. Handles write-oriented databases better, including validation of written data. Supports object-oriented programming features, enabling work with NoSQL-style data, including XML and JSON formats. Can support databases without limitations on data volume. Some companies use PostgreSQL to run databases as large as several petabytes. PostgreSQL and MySQL Comparison For clarity, the main features of both systems can be presented in a table:   PostgreSQL MySQL Supported OS Solaris, Windows, Linux, OS X, Unix, HP-UX Solaris, Windows, Linux, OS X, FreeBSD Use cases Large databases with complex queries (e.g., Big Data) Lighter databases (e.g., websites and applications) Data types Supports advanced data types, including arrays and hstore Supports standard SQL data types Table inheritance Yes No Triggers Supports triggers for a wide range of commands Limited trigger support Storage engines Single (Storage Engine) Multiple As we can see, several features are implemented only in PostgreSQL. Both systems support ODBC, JDBC, CTE (common table expressions), declarative partitioning, GIS, SRS, window functions, and many other features. Conclusion Each system has its strengths. MySQL handles horizontal scaling well and is easier to configure and manage. However, if you expect database expansion or plan to work with different data types, it is better to consider implementing PostgreSQL in advance. Moreover, PostgreSQL is a fully free solution, so companies with limited budgets can use it without fear of unnecessary costs.
24 November 2025 · 6 min to read

Do you have questions,
comments, or concerns?

Our professionals are available to assist you at any moment,
whether you need help or are just unsure of where to start.
Email us
Hostman's Support