Skip to content
The Computer Science
TheCScience
  • Engineering Subjects
    • Human Values
    • Computer System Architecture
    • Digital Communication
    • Internet of Things
  • NCERT Solutions
    • Class 12
    • Class 11
  • HackerRank solutions
    • HackerRank Algorithms Problems Solutions
    • HackerRank C solutions
    • HackerRank C++ problems solutions
    • HackerRank Java problems solutions
    • HackerRank Python problems solutions
The Computer Science
TheCScience

Parallel Processing in Computer Architecture

YASH PAL, March 6, 2026March 6, 2026

Parallel Processing in Computer Architecture – The computational speed of conventional computers is slow. This speed can be increased by performing simultaneous data processing. This is known as parallel processing. This technique is able to perform concurrent data processing to achieve faster execution.

Pipeline processing is a common technique that is used to achieve parallel processing. Pipelining is a technique of decomposing a sequential process into sub-operations. Each sub-operation is executed concurrently in a special dedicated segment with all other segments. The result obtained from the computation in each segment is transferred to the next segment in the pipeline. The final result is obtained after the data have passed through all segments.

Pipelining also consists of vector processing, scalar processing, etc. There are so many computational problems that can not be solved by conventional computers. These problems can be formulated in terms of vectors and matrices. This capability of a computer to solve the computational problem is known as vector processing capability. Computers with vector processing capabilities are required in specialized applications.

Parallel Processing

The term parallel processing indicates that the system can perform several operations simultaneously. Now we will elaborate on the scenario. In a CPU, we will have only one Accumulator, which will store the results of the current operation.

Now, if we are giving only one command, such as “a+b”, then the CPU performs the operation and stores the result in the accumulator. Now we are talking about parallel processing, therefore we will be issuing two instructions, “a+b” and “c-d”, at the same time. Now, if the result of the “a+b” operation is stored in the accumulator, then the “c-d” result cannot be stored in the accumulator at the same time.

Therefore, the term parallel processing is not only based on the Arithmetic, logic, or shift operations. The above problem can be solved in the following manner. Consider the registers RI and R2, which will be storing the operands before the operation, and R3 is the register that will be storing the results after the operation. Now, the above two instructions, “a+b” and “c-d”, will be done in parallel as follows.

  • Values of “a” and “b” are fetched into the registers R1 and R2.
  • The values of R1 and R2 will be sent into the ALU unit to perform the addition.
  • The result will be stored in the Accumulator.
  • When the ALU unit is performing the calculation, the next data “c” and “d” are brought into R1 and R2.
  • Finally, the value of the Accumulator obtained from “a-b” will be transferred into R3. Next, the values of C and D from R1 and R2 will be brought into the ALU to perform the “c-d” operation.
  • Since the accumulator value of the previous operation is present in R3, the result of “c-d” can be safely stored in the Accumulator.

This is the process of parallel processing of only one CPU. Consider several such CPUs performing the calculations separately. This is the concept of parallel processing.

Parallel processing in computer architecture
Parallel Processing

In the above figure One possible way of separating the execution unit into eight functional units operating in parallel. The operands in the registers are applied to one of the units depending on the operation specified by the instruction associated with the operands. The operation performed in each functional unit is indicated in each block of the diagram.

We can see that the data stored in the processor registers is being sent to separate devices based on the operation needed on the data. If the data inside the processor registers is requesting an arithmetic operation, then the data will be sent to the arithmetic unit, and if at the same time another data is requested in the logic unit, then the data will be sent to the logic unit for logical operations. Now, at the same time, both arithmetic operations and logical operations are executing in parallel. This is called parallel processing.

We consider parallel processing under the following main topics:

  1. Pipeline processing: Pipeline processing is an implementation technique where arithmetic suboperations or the phase of a computer instruction cycle overlap in execution.
  2. Vector processing: Vector processing deals with computations involving large vectors and matrices.
  3. Array processing: An array processor performs computations on large arrays of data.
Computer System Architecture engineering subjects Computer System Architecture

Post navigation

Previous post
Next post

Computer Architecture fundamentals
Basic structure of a computer
Functional Units of Computer
Development of Computers
Von Neuman and Harvard machine Architecture
Flynn Classification
Computer Structure Architecture
Interfacing Logic Devices
Levels of Design abstraction
Performance Metrics

Register Transfer Language
Memory Transfer
Arithmetic Micro-operations
Arithmetic Complements
Logic Micro-operations
Shift Micro-operations
Bus Architecture
Data Transfer
Central Processing Unit
CPU Bus Architecture

Computer Register and Types
Common Bus System
Instruction Format
Instruction Types
Instruction Cycle
Addressing Modes
Design of a basic computer

Basic function of a Computer
General register organization
Stack organization
Infix to Reverse Polish Notation Conversion
Instruction Types and their classifications
Data transfer and manipulation
Program control
RISC characteristics
CISC characteristics

Pipeline
Types of Pipeline
Arithmetic Pipeline
Instruction Pipeline
Hazards
Vector Processing

Data Representation
Addition and Subtraction
Adder Circuits
Shift and Add Multiplication Method
Booth's Algorithm
Restoring Division Algorithm
Non-Restoring Division Algorithm
Array Multiplier

Memory Classification
Memory Characteristics
Memory Organization
Memory Types
Associative Memory
Cache Memory
Virtual Memory

Input Output Interface
Modes of Data Transfer
Priority Interrupt
Direct Memory Access
Input-Output Processor
Serial Communication

TheCScience

We at TheCScience.com are working towards the goal to give free education to every person by publishing in dept article about Secondary, Senior-Secondary, and Graduation level subjects.

Pages

About US

Contact US

Privacy Policy

DMCA

Our Tools

Hosting - get 20% off

Engineering Subjects

Internet of Things

Human Values

Digital Communication

Computer System Architecture

Programming Tutorials

Data Structure and Algorithm

C

Java

NCERT

Class 12th

©2026 TheCScience | WordPress Theme by SuperbThemes