bug question Hey there we just stopped an experiment and ma GrowthBook Users #experimentation

:bug: :question: Hey there, we just stopped an exp...

microscopic-ocean-92851

11/27/2024, 9:20 PM

🐛 ❓ Hey there, we just stopped an experiment and made Variation 1 auto-rollout. I saw the correct state on the feature flag and on the experiment. Both looked publish and correct. However, when we skipped the cache on init and then asked for if the flag was on, we got:

Copy code

on: false, source: 'defaultValue'

I would not expect to see the default of the feature flag, when the experiment rule should be serving true

✅ 1

microscopic-ocean-92851

11/27/2024, 9:29 PM

PS. The old revision is because we disabled the experiment rule and changed the default value instead to get the rollout we wanted.

helpful-application-7107

12/04/2024, 4:39 PM

However, when we skipped the cache on init and then asked for if the flag was on

Did you set a

uniqueId

when you did this test?

helpful-application-7107

12/04/2024, 4:39 PM

Sorry for the slow response here (got lost in the holiday sauce!)

helpful-application-7107

12/04/2024, 4:48 PM

The temp rollout basically takes everyone that would get into the experiment and instead shows them one value. A unit only "should" get into the experiment if they have a

uniqueId

set.

microscopic-ocean-92851

12/04/2024, 6:46 PM

Sorry for the slow response here (got lost in the holiday sauce!)

No worries at all!

Did you set a
uniqueId
when you did this test?

Yes, we have been running the code in prod for a bit and have been monitor against an A/A test of the same code. For some clarity, we have successfully used the rollout once before, but for some reason in this case we did not see what we expected. PS. I am not exactly sure if a rollout will have an experiment/experiment result when asking about a feature. I reached out mainly that I could repro at the time in a code sandbox ignoring our more complicated fanout (Cloudfront -> Vercel / Edge / NextJS) each of which has it's own caching layer. The code sandbox was simply init'ing GB and inspecting the getFeatures results

helpful-application-7107

12/04/2024, 6:47 PM

I am not exactly sure if a rollout will have an experiment/experiment result when asking about a feature.

I think it should.

helpful-application-7107

12/04/2024, 6:48 PM

Was the experiment still in the sdk payload when it was in the temp rollout state? I asked one of our eng and he agreed that it should function like an experiment but everyone will get the same value.

microscopic-ocean-92851

12/04/2024, 6:49 PM

The logging shown above is us adding a custom logger, but we do print the entire result object. i.e. The image above shows no experiment running when the rollout was active

microscopic-ocean-92851

12/04/2024, 6:49 PM

Was the experiment still in the sdk payload when it was in the temp rollout state?

Not that I saw at the time 😓

helpful-application-7107

12/04/2024, 6:49 PM

Got it. Understood.

helpful-application-7107

12/04/2024, 6:49 PM

I believe it should be from my understanding, let me follow up with the team here.

microscopic-ocean-92851

12/04/2024, 6:51 PM

I can also poke at the source. I've read most of the JS/React code, but I have not looked at your server's code.

microscopic-ocean-92851

12/04/2024, 6:52 PM

Basically our steps to repro: • Feature flag - default off • Add A/B test • Stop experiment with B winning and Rollout B

happy-autumn-40938

12/04/2024, 6:52 PM

fyi: I think an easier way to validate that this is or is not working would be to use the feature tester at the bottom of the feature page.

happy-autumn-40938

12/04/2024, 6:56 PM

basically simulate a user who normally would have been in the experiment, change the experiment to a temp rollout, and validate if they still get the feature value corresponding to the rolled out variation

happy-autumn-40938

12/04/2024, 6:57 PM

once its a temporary rollout, it's no longer an "experiment" in the way it presents in the SDK. it actually materializes as a forced rollout rule.

happy-autumn-40938

12/04/2024, 6:59 PM

example

happy-autumn-40938

12/04/2024, 7:00 PM

prior to the temporary rollout, i am bucketed normally

happy-autumn-40938

12/04/2024, 7:08 PM

(trying to rule out whether this is an SDK-level problem, caching problem, feature flag/experiment setup problem, or a bug on our end)

microscopic-ocean-92851

12/04/2024, 7:20 PM

Okay I found the issue: https://codesandbox.io/p/sandbox/happy-elgamal-55y52n Basically, due to limitations with Vercel's CDN caching we have been doing the computation of the Experiment's a user would see at the edge. This would allow us to only split our CDN caching based on the available number of experiment buckets. However, this limitation also forces us to not pass user specific attributes (user id, unique id, etc) from the edge down to the main app. Thus, when we rebuild a growthbook instance on the server we are not passing the uniqueId attribute and the result is:

Copy code

Skip rule because user not included in rollout {id: 'testing', rule: Object}
Use default value {id: 'testing', value: false}

microscopic-ocean-92851

12/04/2024, 7:23 PM

We have been aware that our implementation is non-standard, at least according to your examples. But this was mainly so we can leverage the CDn cache as much as possible to limit our Vercel costs

happy-autumn-40938

12/04/2024, 7:23 PM

yeah unfortunately BE/Edge experimentation on user attributes + cached responses doesn't often mix

microscopic-ocean-92851

12/04/2024, 7:25 PM

I think the "bug" from my point of view is that I forced an experiment value in the code, but that code never is executed as the skip rule is hit first

microscopic-ocean-92851

12/04/2024, 7:26 PM

I guess I could set a hardcoded uniqueId attribute on the server for all users. It would never be used since we'd force the feature for every user

happy-autumn-40938

12/04/2024, 7:26 PM

I see. Side question: how do your tracking calls work when doing edge evaluation?

microscopic-ocean-92851

12/04/2024, 7:26 PM

Can I send some files over directly to you?

happy-autumn-40938

12/04/2024, 7:27 PM

sure

microscopic-ocean-92851

12/04/2024, 7:31 PM

Send all of it over to you, but the main bit you are asking about is attached.

Untitled

microscopic-ocean-92851

12/04/2024, 7:31 PM

I am unhappy that we had to mimic a lot of the internals of the JS source, but it does work great!

microscopic-ocean-92851

12/04/2024, 8:18 PM

I ended up doing more testing and I am notice something. If I

console.log(growthbook.evalFeature("testing"));

there is no experiment in the

evalFeature

's result. My expectation was that there would be an experiment and result still. Re: https://growthbookusers.slack.com/archives/C07E4HA06MD/p1733338164984769?thread_ts=1732742454.275509&cid=C07E4HA06MD It is in the payload, but not returned by evalFeature in the same shape as a running experiment, which breaks the logic I've shown you

helpful-application-7107

12/04/2024, 8:21 PM

I believe I misspoke about it still being an experiment in the payload. It was based on a misunderstanding of how this works.

helpful-application-7107

12/04/2024, 8:21 PM

As Bryce says above it actually materializes as a forced rollout rule.

👍 1

microscopic-ocean-92851

12/04/2024, 8:28 PM

Thanks guys. I really appreciate all the help. I told Bryce that I likely will stew on our implementation and see what changes I can suggest that would allow our use case to work out of the box. Overall, we love the product and know our CDN caching and computing at the edge is not the use case the product was built for, but I'm sure others out there will want to try the same. Basically the only thing we fight are very small behavior differences where we made an assumption. ...Like in this case, our code assumes any experiment even one which is rolled out will have an experiment and result object after eval

microscopic-ocean-92851

12/04/2024, 8:29 PM

anyways, nothing for you all to do unless you think it's reasonable to keep the experiment behavior on eval

happy-autumn-40938

12/04/2024, 8:31 PM

We're actually speaking internally about ways we can either make edge/hybrid testing easier or be more prescriptive with better docs/examples. If you have any ideas, we'd love to hear them

microscopic-ocean-92851

12/04/2024, 8:48 PM

more prescriptive

That would have helped a lot. We started the journey back in Feb. We were on the App router, but there was no example at the time. Once we determined we wanted to run at the edge for caching reasons it took a lot of trial and error to get here.

microscopic-ocean-92851

12/04/2024, 8:59 PM

The main thing we do is pre-compute the buckets a user may see at the edge. Then if we ask about a feature, use the pre-computed value. On the server, this requires us to call

setForcedVariations

to bypass the experiment logic on the server where we do not have the hash attribute available. This works as long as we run the experiment logic. Since the rollout's eval feature is not actually computed as an experiment, we are failing to set the variation. I would use

getExperiments

and then just ask for the value for each. However that returns the experiments from the payload which is empty even when we do have experiment rules.

Copy code

{
    "status": 200,
    "features": {
        "product-overview-reorder-sections": {
            "defaultValue": false,
            "rules": [
                {
                    "coverage": 1,
                    "hashAttribute": "uniqueId",
                    "seed": "647ef094-a7c6-4dcc-adf4-d1c216286e6e",
                    "hashVersion": 2,
                    "variations": [
                        false,
                        true
                    ],
                    "weights": [
                        0.5,
                        0.5
                    ],
                    "key": "product-overview-reorder-sections",
                    "meta": [
                        {
                            "key": "0",
                            "name": "Description First"
                        },
                        {
                            "key": "1",
                            "name": "Compatibility First"
                        }
                    ],
                    "phase": "0",
                    "name": "Product Overview Reorder Sections"
                }
            ]
        },
        "testing": {
            "defaultValue": false,
            "rules": [
                {
                    "coverage": 1,
                    "hashAttribute": "uniqueId",
                    "seed": "3554bf39-d234-42a9-8a8f-93766d6203b8",
                    "hashVersion": 2,
                    "force": true
                }
            ]
        },
    "experiments": [],
    "dateUpdated": "2024-12-04T18:54:47.700Z"
}

happy-autumn-40938

12/04/2024, 9:02 PM

fyi: getExperiments is for "auto experiments" (i.e. redirects and visual editor exps)

👍 1

microscopic-ocean-92851

12/04/2024, 9:03 PM

I figured that out eventually, but it is not obvious when you are first using the SDK

microscopic-ocean-92851

12/04/2024, 9:05 PM

My ideas: • something like

getExperimentsFromPayload

or a

computeExperimentsValues

which accounts for rollout experiments • An example of computing the experiments at the edge and passing them into a Next.js app ◦ In our case we rewrite the url to add the computedExperiments in a path and then parse it serverside ▪︎ This allows the CDN to cache server responses for each possible arrangement of values

👀 1

microscopic-ocean-92851

12/04/2024, 9:06 PM

We are currently not handling forced experiments at the edge, hence the issue with the rollout

microscopic-ocean-92851

12/04/2024, 9:06 PM

Even now an example of how you all would tackle the vercel caching and use the edge would go a long way

48 Views

Open in Slack

Previous Next