Hi Tony,
The best approach is to keep all users in the experiment, otherwise results will be biased. This provides the best apples-to-apples comparison w/r/t the following scenarios:
1. no users are eligible to receive treatment
2. all users are eligible to receive treatment
The first scenario will occur if you decide to rollback your feature. The second scenario will occur if you decide to ship your feature.
If only a small percentage of users are make it through the downstream service, your power may be low. If power is low, and the users filtered out by treatment are a subset of the users filtered out by control, then the following approach can increase power.
Create a segment of users that would be filtered out by treatment, regardless of whether they were assigned to control or treatment, and analyze this segment. You will want to check that the segment is 50/50 control/treatment. I understand that you may not be able to identify this segment, depending upon how your pipeline is set up, but if you can, it can increase power and provide unbiased results on the customers who would receive (not just be assigned) to treatment.
Sorry for the long answer, feel free to ask more questions!
Luke