Tumor intracellular delivery is a vital focus in cancer therapy, aimed at effectively transporting therapeutic agents directly into tumor cells to enhance treatment efficacy. Advanced techniques, such as NP-based systems and liposomes, are designed to encapsulate drugs or genetic material and facilitate their uptake by cancer cells [
175]. Additionally, researchers are exploring innovative approaches such as stimuli-responsive systems that release their payload in response to the unique microenvironment of tumors, including pH changes [
176] or enzyme response [
177] By improving intracellular delivery into tumor cells, these strategies aim to potentiate the therapeutic impact and overcome resistance mechanisms, paving the way for more effective cancer treatments. By making some special designs on the morphology of microrobots, it can be facilitated to enter the cell. Yan et al
. [
178] developed biomimetic nanorobots with a head and hollow tail using a site-selective superassembly strategy. Their self-propulsion capability is induced by temperature gradient, and spiky heads significantly enhance cellular uptake at interfaces, facilitate transvascular extravasation, and improve intratumoral penetration. As Fig.
6E shows, spike-robots with NIR irradiation have better cell entrance ability. Upon entering the cell, microrobotic swarms face the risk of degradation due to cellular internalization and subsequent sequestration within lysosomes, where hydrolytic enzymes and acidic conditions prevail [
179–
181]. Therefore, the release from lysosomes is critical for transporting microrobots into the cytoplasm and for their effective intracellular function. Various strategies have been developed to promote the escape of nanomaterials from lysosomes. Certain positively charged substances function as drug carriers that enter lysosomes, destabilizing the negatively charged lysosomal membrane through electrostatic interactions, which facilitates lysosomal escape [
182]. However, positively charged materials are rapidly cleared from circulation, limiting their ability to reach tumors effectively [
183]. Alternatively, extracellular membrane vesicles that express fusion peptides can achieve membrane fusion due to conformational changes induced by the acidic environment of lysosomes [
184]. Nonetheless, all these strategies primarily depend on passive diffusion to target tumor sites. Peng et al
. [
68] introduced autonomous nanorobots for active drug delivery targeting mitochondria (Fig.
6F). These microrobots are created by simply encapsulating mitochondriotropic doxorubicin-triphenylphosphonium (DOX-TPP) within zeolitic imidazolate framework-67 (ZIF-67) NPs. ZIF-67 can decompose bioavailable H
2O
2 to lead to lysosome escape. As Fig.
6F shows, ZIF-67@DOX-TPP nanorobots exhibited effective motion tracking with colored trajectories inside the T24 bladder cancer cell compared to normal cells (SV-HUC-1). From the fluorescent imaging, the Pearson's correlation coefficient value of the ZIF-67@DOX-TPP group (0.97) was higher than the DOX group (0.11), which means that the ZIF-67@DOX-TPP group possesses better lysosomal escape.