Pick-N-Place

Introduction to Robotics

2025

Introduction to Robotics taught me robotics theory into real systems that integrate perception, kinematics, and control, aligned with my goal of building human-centered robots. Through this project, I gained hands-on experience with vision-based manipulation, coordinate transformations, inverse kinematics, and ROS-based robot control.

Key Contributions:

Autonomous Pick-and-Place Pipeline: Developed an end-to-end vision-to-manipulation system enabling a UR3 robotic arm to autonomously pick and stack blocks using ROS
Kinematics-Driven Motion Planning: Derived and implemented forward and inverse kinematics for a 6-DoF UR3 arm to generate reliable joint-space trajectories from arbitrary block configurations
Vision-Based Block Localization: Implemented OpenCV-based color segmentation and blob detection, converting image-frame pixel coordinates into world-frame positions for precise grasping.

This project taught me how perception errors, kinematic modeling, and control decisions are tightly coupled in robotic manipulation. I learned the importance of accurate coordinate transformations and inverse kinematics in achieving reliable real-world pick-and-place performance.

Objective

The objective of this project was to design an autonomous pick-and-place system for a UR3 robotic arm by integrating vision-based block detection, coordinate transformation, inverse kinematics, and ROS-based control. The system detects colored blocks, computes feasible joint trajectories, and executes reliable grasping and placement in a structured workspace.

Method

Forward Kinematics

Forward kinematics was used to compute the end-effector pose from joint angles, enabling verification of inverse kinematics solutions and ensuring accurate execution of pick-and-place motions. In this project, the screw axes (w_i, v_i) define each UR3 joint’s rotation axis and linear component in the space frame, enabling forward kinematics to be computed as implemented in the lab kinematics functions.

T(\theta)=e^{[S_1]\theta_1}e^{[S_2]\theta_2}e^{[S_3]\theta_3}e^{[S_4]\theta_4}e^{[S_5]\theta_5}e^{[S_6]\theta_6}M

Inverse Kinematics

Inverse kinematics was used whenever the robot needed to move the end-effector to a target block location or placement position defined in world coordinates. After detecting block positions from the camera and transforming them into the world frame, the project computed an elbow-up inverse kinematics solution for the UR3 arm based on its mechanical schematic.

Using the known lengths and offsets of each UR3 link, the wrist center was first calculated by compensating for the gripper geometry and desired end-effector yaw. The shoulder and elbow joint angles were then solved geometrically using trigonometric relationships derived from the robot’s link lengths, and the remaining wrist joints were computed to satisfy the target orientation, enabling accurate and repeatable pick-and-place motions.

Block (Blob) Perception

Block perception was implemented using a camera-based vision pipeline in Python to detect colored blocks and localize them for manipulation. Camera images were processed using OpenCV and NumPy, where images were converted to the HSV color space and thresholded to segment target block colors. Blob detection was then applied using OpenCV’s SimpleBlobDetector to extract the centroids of valid block regions while filtering out noise and small artifacts.



raw_image = self.bridge.imgmsg_to_cv2(data, "bgr8")
cv_image = cv2.flip(raw_image, -1)

xw_yw_G = blob_search(cv_image, "green")
xw_yw_Y = blob_search(cv_image, "yellow")

def blob_search(image_raw, color):

    # Setup SimpleBlobDetector parameters.
    params = cv2.SimpleBlobDetector_Params()

    # Filter by Area.
    params.filterByArea = True
    params.minArea = 50

    # Create a detector with the parameters
    detector = cv2.SimpleBlobDetector_create(params)

    # Convert the image into the HSV color space
    hsv_image = cv2.cvtColor(image_raw, cv2.COLOR_BGR2HSV)

    if color == "yellow":
        lower = (20, 100, 100)
        upper = (30, 255, 255)

    if color == "green":
        lower = (45, 60, 55)
        upper = (70, 255, 255)

    mask_image = cv2.inRange(hsv_image, lower, upper)

    keypoints = detector.detect(mask_image)

    # Find blob centers in the image coordinates
    blob_image_center = []
    num_blobs = len(keypoints)
    for i in range(num_blobs):
        blob_image_center.append((keypoints[i].pt[0], keypoints[i].pt[1]))

The detected block centroids in pixel coordinates were subsequently transformed into world-frame coordinates using a calibrated image-to-world mapping. This geometric transformation enabled the vision system to interface directly with the robot’s inverse kinematics solver, providing accurate target positions for autonomous pick-and-place execution.

# Params for camera calibration
theta = 0.026660348374597458
beta = 0.001332267944962386
tx = 271 * beta + 0.01  # m
ty = 44 * beta          # m

# Function that converts image coord to world coord
def IMG2W(col, row):
    Or = row
    Oc = col

    xc = beta * Or
    yc = beta * Oc

    Rcw = np.array([[np.cos(theta), -np.sin(theta), 0],
                    [np.sin(theta),  np.cos(theta), 0],
                    [0,              0,             1]])

    xc = xc - ty
    yc = yc - tx

    pw = Rcw @ np.array([xc, yc, 0])
    return [pw[0], pw[1]]

ROS Software Development

The ROS software architecture was designed to decouple perception, kinematics, and control to support real-time operation and robust data flow.

Used ROS image callbacks to process camera data independently from arm motion, enabling smooth and reliable execution.
Ensured continuous data flow from perception through kinematics to motion execution without blocking or race conditions

Takeaways

Motion and Trajectory Planning Are Critical for Safety

During testing, there were moments when a poorly chosen joint trajectory caused the robot arm to swing uncomfortably close to surrounding lab equipment, including the monitor. That experience made it clear to me that solving kinematics alone is not enough, but how the robot moves between poses is just as important as reaching the target safely.

Manipulation Must Account for Perception Constraints

I also realized that certain arm configurations unintentionally blocked the overhead camera’s view of the workspace, which disrupted block detection. This taught me that effective manipulation requires coordinating motion planning with perception constraints, and it motivated me to learn more about efficient, collision-aware trajectory planning to make the system both faster and more reliable.