Behind the Scenes of Our Self-Driving RC Car Project

Mallik Inc.
Feb 13
5 min read

1. Project Motivation & Scope

This project demonstrates small-scale autonomous driving using a standard RC car chassis, a Raspberry Pi, and a Convolutional Neural Network (CNN). While the platform is an affordable hobby-grade model, the core methods—data collection, sensor processing, and neural network inference—mirror techniques used in full-scale self-driving systems.

Key objectives:

Automate Steering & Throttle based on real-time camera input.
Employ Python & CNN-based computer vision for lane detection or track following.
Optimize for Edge Inference on a Raspberry Pi with limited CPU/GPU resources.

2. Hardware & Electronics Breakdown

RC Car Chassis
- A standard hobby RC car (~1/10 or 1/12 scale).
- Includes a servo for steering and an electronic speed controller (ESC) for throttle.
- Modified to accept control signals from the Pi via GPIO pins or a dedicated motor driver board (e.g., L298N).
Raspberry Pi (Model 3 or 4)
- Runs a lightweight Linux distribution (Raspberry Pi OS).
- Handles image capture via the Pi Camera or a USB webcam.
- Executes CNN inference in Python, typically using frameworks like TensorFlow Lite or PyTorch (with CPU-only support unless using specialized Pi-compatible accelerators).
Camera Module
- Either the official Pi Camera (CSI interface) or a standard USB webcam.
- Mounted at the car’s front bumper or slightly above it to mimic a driver’s perspective.
- Often set to capture frames at 20–30 FPS for smoother real-time inference.
Power Supply
- Separate LiPo batteries or power banks: one for the RC motors/ESC, another for the Pi if needed.
- Ensures the Pi’s voltage remains stable under dynamic load conditions from the motors.

Optional Extras

Additional Sensors: Ultrasonic or LiDAR for depth measurement or collision avoidance.
PWM Driver Board: If you need more precise control signals (e.g., PCA9685 for hardware PWM).

3. Data Collection & Preparation

3.1 Manual Driving Sessions

Remote Control Driving:
- Drive the RC car manually around a track or indoor course.
- The Pi records video frames at a chosen frame rate (e.g., 10–20 FPS).
Logging Steering & Throttle:
- Use a Python script to capture real-time servo/ESC commands from the RC receiver or direct user input.
- Each camera frame is timestamped and paired with the corresponding steering angle (and throttle if desired) in a CSV or JSON file.
Data Transfer & Storage:
- Store images locally on the Pi’s microSD card or an attached USB drive.
- Optionally transfer data via SSH/scp to a more powerful machine for training.

3.2 Data Preprocessing

Image Resizing: Common resolutions range from 120×160 to 66×200 pixels to reduce computation.
Cropping: Remove sky or car hood from the image to focus on the road or track.
Normalization: Scale pixel values (e.g., 0–1 or -1 to 1) to stabilize CNN training.
Augmentation (Optional): Random brightness shifts or slight rotations. This can improve generalization, though flips can be problematic since left/right steering is directional.

4. CNN Architecture & Training

4.1 Network Design

A Convolutional Neural Network is employed for end-to-end steering angle prediction. A typical architecture might include:

Input Layer: (height×width×channels) = e.g., (66×200×3).
Conv-Pool Stacks:
- Convolutional layers (3×3 filters) extracting low-level features.
- ReLU activation for non-linearity.
- Max Pooling to reduce spatial dimensions and control overfitting.
Flatten & Fully Connected Layers:
- Transforms the final convolution output into a 1D vector.
- Several dense layers with ReLU activation lead to the final output neuron.
Output Layer: Single neuron for steering angle (float value). For throttle, you might have a second neuron or a multi-output model.

Loss Function

Mean Squared Error (MSE) is common for regression-based steering predictions.

Optimizer & Training Settings

Adam optimizer (learning rate ~1e-3) is often chosen for its adaptive learning properties.
Batch sizes typically range from 16 to 64, depending on available system memory.
Training can run for 10–50 epochs or until validation loss stabilizes.

4.2 Training Workflow

Environment: Training can occur on a desktop/laptop GPU if the Pi is underpowered. Once trained, the model is converted or saved for inference on the Pi.
Validation & Testing:
- Split data into train/validation sets (e.g., 80/20).
- Monitor validation loss to detect overfitting.
Model Conversion:
- For on-device inference, consider exporting to TensorFlow Lite or a lightweight PyTorch model.

5. Real-Time Inference on the Pi

5.1 Deployment & Integration

Inference Script
- A Python script captures each camera frame.
- The CNN processes the frame to output a steering angle (and possibly throttle).
- Commands are sent via GPIO/PWM to control the servo and ESC in near real-time.
Latency Considerations
- On a Pi 3 or 4 CPU, typical inference times for a small CNN can range between 30–100ms.
- Minimizing model size and employing integer quantization can help reduce overhead.
Control Loop
- A loop runs at a fixed frequency (e.g., 10–20 Hz), capturing a frame, running inference, and updating servo/throttle signals.
- Additional heuristics or filtering (e.g., exponential smoothing) can stabilize rapid angle changes.

6. Key Observations & Technical Challenges

Limited Computing Resources
- The Raspberry Pi’s CPU/GPU may constrain CNN complexity. Pruning or compressing models is often necessary for real-time performance.
Data Diversity
- Model generalization heavily depends on varied training data (lighting conditions, track layouts). Overfitting to a single environment can lead to poor performance elsewhere.
Servo/ESC Calibration
- Hardware offsets, neutral points, and maximum steering angles vary between RC car models. Fine-tuning these is crucial for consistent control.
Safety & Testing Environment
- Even at small scale, unexpected inference results can lead to collisions. Testing in open spaces or enclosed tracks reduces risk.

7. Future Plans

Better Models & Hardware
- Evaluate more powerful single-board computers (e.g., NVIDIA Jetson Nano) or neural accelerators for larger CNN architectures.
- Investigate advanced model designs (e.g., MobileNet or attention-based networks) to improve speed and accuracy.
Scaling to a Full-Sized Vehicle
- In the long term, adapt the system to an older automatic car, adding external sensors (LiDAR, radar) and robust fail-safe mechanisms.
- This would involve integrating drive-by-wire components, advanced safety checks, and compliance with local regulations.
Additional Sensing
- Incorporate ultrasonic or LiDAR sensors for depth perception and obstacle avoidance.
- Improve reliability under varied conditions (night driving, uneven terrain).

Conclusion

This RC car project provides a technical foundation in real-time computer vision, deep learning, and embedded systems engineering. By combining a Raspberry Pi, carefully curated training data, and an optimized CNN, it’s possible to achieve autonomous driving at a small scale. Much of this approach can be extended to larger robotics projects, warehouse automation, or eventually converting a full-sized vehicle for more advanced self-driving experiments.

For a detailed, step-by-step tutorial, be sure to check out the full Medium series:

If you’d like to explore how these methods could be applied to enterprise-scale AI solutions or customized robotics, feel free to reach out to us at KaizenDev. We’re always eager to collaborate and push the boundaries of technology.