Crop sensor can crop with a greater pixel density tho correct?
Yes, as compared to cropping later in an editor, the DX image has many more pixels.
The lens projects the same image regardless of what sensor is present. If you can remember back to film days, it is exactly like a movie projector or a slide projector, projecting an image. Regardless of what size projection screen we might place at say the same 10 feet, the projector projects the same image on it. A smaller screen would be like a cropped sensor, seeing only a smaller view of that larger projected image. We could call that a "cropped screen".
But it is exactly the same concept as the cropped sensor. The sensor is merely a projection screen for the projected lens image.
Let's say that our FX camera lens projects a circular image about 44mm diameter. That projected image falls on whatever sensor is present.
The 36x24mm full frame sensor has a diagonal of 43.27 mm. It captures a photo image that size.
The 24x16mm DX sensor has a diagonal 28.8 mm. The FX image is 43.27 / 28.8 = 1.5x larger. The DX sensor simply crops the view of the full lens image. The lens image is not affected in any way by the sensor that is present (regardless if the focus distance is 1:1 macro, or 30 feet distant). The cropped result is affected by the smaller sensor, but the lens is not affected. There is no magic, the cropped DX sensor is simply smaller.
The cropped DX sensor might have 24 megapixels, and yes, of course it retains those 24 megapixels.
The full FX image may have 24 megapixels too, but if cropped to the the same smaller size later (2/3 width and 2/3 height, because 1/1.5 is 2/3), it only has about 44% of its original pixels remaining then. So cropping later is the same "view", but the pixel count suffers seriously. Depending on use, may or may not have enough pixels remaining.
A 50mm DX is said to be an "Equivalent Focal Length" as 75 mm FX. The way we say this is misleading, causes confusion. It is only the FX body that sees the Equivalent focal length, not the DX body which only sees 50 mm. All it actually means is that the 50 mm DX body sees the same Field of View as a 75 mm FX body (assuming both at the same distance). These are different magnifications, since the FX sees the scene 36 mm wide, and the DX sees the same Field of View to be 24 mm wide. The FX body can duplicate that 50mm DX field of view either by using the 1.5x longer lens, or standing 1.5x closer. This is only because its sensor is 1.5x larger, so its field of view is larger.
So we imagine the DX lens as being somehow magically 1.5x longer than it actually is, like a 1.5x telephoto lens would see on a FX body, but only applies if we are comparing in terms of what FX sees. Not true if thinking about that the DX body sees. A 50 mm lens is always 50mm. And the smaller DX image does have to be enlarged half again more to be "equal" size of the FX image, so all things are still not equal.
Anyway, the point was, if the lens is projecting a 1:1 macro image, it is 1:1 regardless of which sensor is seeing it. The FX field of view will be 1.5x larger than the DX, since the DX image is a cropped smaller area of it, but the magnification is the same 1:1.
So the original point was that the 1:1 is itself a magnification, and there is not even this sense of Equivalent (field of view). If the lens is focused to 1:1, both sensors see 1:1. meaning the subject details are reproduced life size on the sensor. However at same magnification, now they see different fields of view (the DX is a smaller cropped field of view).