POC: HTTP reverse proxy cache #3610

usu · 2023-06-30T18:18:48Z

This PR implements reverse proxy caching for the following endpoints:

/api/content_types
/api/content_types/123
/camp/123/categories

The cache-hash includes the JWT cookie, so the cache is personal for each user (for each login/JWT, to be specific).

Cache is purged automatically for updates/deletes/creates that impact the cached responses. A xkey strategy is used for this which deviates from the standard api-platform cache tag strategy.

This PR also includes upgrade to api-platform 3.3
For simplicity of code review, it would make sense to merge #4942 beforehand.

To do:

Use feature flag to enable/disable cache on localhost and on deployment. Ensure everything works with or without.
Enable prometheus/grafana metrics and logging of errors. Consider using varnish helm chart provided by https://github.com/softonic/varnish.
Find good alternatives to limit permissions for purge operations
Ensure cache is always purged after a deployment
Document http cache setup (outside of this PR) (WIP under https://github.com/ecamp/ecamp3/wiki/Reverse-proxy-cache-(API-cache))
Unit tests for new PHP classes

To do after review / before merging:

Squash commits to avoid messy history
Disable api cache by default, such that we can enable for each environment separately (dev, staging, prod)
Disable X-Cache-Debug (or keep it enabled by purpose, if we decide so)

Blocked by:

API-platform release 3.3
need PR feat(serializer): collect cache tags using a TagCollector api-platform/core#5758 and PR fix(serializer): fix TagCollector for JSONAPI and HAL format api-platform/core#6076
FOS HTTP cache bundle 3.0
compatibility with symfony 7; Update to Symfony 7 FriendsOfSymfony/FOSHttpCacheBundle#598

See below for the original description of this PR.

This is an example POC for a HTTP cache in front of our API.

General

What's the purpose

HTTP reverse proxy sits in front of the application and caches HTTP responses. Originally, this was mostly used for static content. However, with a smart invalidation mechanism in the application, HTTP caches can also be used for dynamic data.

Cache tags & surrogate keys

Most HTTP caches implement the invalidation with surrogate keys (specific implementation of cache tags).

This recording (Take your Http caching to the next level with xkey & Fastly) is a bit older (2018) but provides a good and simple overview of how cache tags work. The presentation is based on varnish with xkey (=Surrogate keys) and Symfony FOSHttpCache.

Implementation in API/core

api-platform/core already includes an implementation for automatically adding cache-tags and to invalidate them. Out of the box, it supports Varnish & Souin (although implementing support for any other HTTP cache like Fastly, Cloudflare, etc. would be fairly easy).

In theory, Cache tags could really be any sort of strings. In api-platform, it's however implemented by referencing IRIs. When cache invalidation is enabled, the following is the default behaviour of api-platform:

Every response contains an additional HTTP header (cache-tags, xkey, etc.; depends on the configuration) which references all IRIs of all entities included in the response (both embedded entities and linked entities). This tags are collected during the normalization process ($context['resources']) and added to the HTTP response in AddTagsListener.
A event listener is subscribed to changes on Doctrine entities (updates, inserts, deletions). If any such change is detected, a purge request is sent to the configured HTTP cache. This purge request included the relevant cache-tags (=IRIs) to purge

Souin vs. Varnish

Earlier, Varnish was included in the official api-platform/api-platform template. At the moment, however, neither Varnish nor Souin is included out of the box (docs are outdated, as they mention Varnish is still included).

There are open PRs for both integration of Varnish and Souin (api-platform/api-platform#2383).

From the discussion on api-platform, I got the impression that Souin is more simple and more modern, so I tried Souin first. Was not really happy, though: Documentation is very meager. I had to look at Souin code and PRs multiple times to figure out how to use it. Finally, I struggles to somehow include our JWT cookies in the cache key, so I gave up with Souin and switched to Varnish.

Hence, this PR includes Varnish. It might be a bit more complex initially. However, documentation is quite good. It is widely used and hence well proven. And the VCL language is really powerful and allows almost everything to be implemented.

This PR

This PR implements a simple setup of Varnish in front out our API.

How to test out

Go to http://localhost:3000 to access the uncached version of ecamp
Go to http://localhost:3004 to access the cached version of ecamp

What is implemented

Simple varnish setup
Only API calls (and only haljson-format) are potentially cached. Everything else bypasses the cache (return(pass))
Both JWT cookies are part of the cache key (=hash). This means, every user has it's own cache data (cache is not shared between users)

Beneath the simple setup, the following "advanced" features are implemented preliminary:

each response voted positively by CampRoleVoter includes the respective CampCollaboration in the cache tags (=cache is purged when anything is modified on this CampCollaboration)
similar for CampIsPrototypeVoter (including Camp in cache tags)

The commit history is clean. So it might make sense to walk through the individual commits to understand the implementation (especially to see the changes made to PurgeHttpCacheListener).

Use cases & examples

Basic caching functionality

Go to http://localhost:3004/api
Login with test@@example.com
Issue a GET request on /api/periods/e8c03e4285cb
Issue the same request again. You can verify, that you received a cached response based on the response time (3-4ms instead of >100ms). (At the moment, there's no response header enabled to show, that you received a cache hit. This could be implemented easily though with a VCL snippet, if we want to.)

Basic invalidation/purge

Issue a PATCH request on /api/periods/e8c03e4285cb (e.g. changing the description)
Issue a GET request on /api/periods/e8c03e4285cb (notice that response was not cached and now includes the new description)

Invalidation of collection

Issue a GET request on /api/periods
Add a new period (POST on /api/periods)
Issue a GET request on /api/periods (notice that response was not cached and now includes the new period)

Cache scoped by JWT

Open http://localhost:3004/api in another browser or in icognito mode
Login with [email protected]
Issue GET request on /api/periods/e8c03e4285cb
Verify, that you didn't receive a cached version (response time >100ms)

Invalidation via CampCollaboration

Open the first browser (where you are logged in with [email protected])
Go to http://localhost:3004/camps/05ce4b9836e9/Harry-Potter-Lager/collaborators and deactivate Baghira ([email protected])
Go to second browser (incognito mode)
Issue GET request on /api/periods/e8c03e4285cb (you receive a 404)

Invalidation scoped by camp

(for an explanation of this functionality, read below "Frequent invalidation of collections")

Open the first browser at http://localhost:3004/api
GET request on /api/camps/c4cca3a51342/categories
GET request on /api/camps/05ce4b9836e9/categories
POST on /api/categories with \

{
  "camp": "/camps/05ce4b9836e9",
  "short": "LS",
  "name": "Lagersport",
  "color": "#4DBB52",
  "numberingStyle": "1"
}

GET request on /api/camps/c4cca3a51342/categories (still cached)
GET request on /api/camps/05ce4b9836e9/categories (not cached; includes new category)

Potential issues

Documentation of issues I ran into or potential issues I can see.

Too many surrogate keys (header too long)

Due to the fact, that each IRI is included in the response cache-tag header, this header can get really large. So large, that in fact it exceeds the limit Varnish has configured for HTTP headers. On ecamp: Try the /activities endpoints. Chances are, the request will fail with 500 due to this issue.

Others have run into this issue as well. On the linked issues, several options to remedy are discussed.

Besides the options listed in the issue, a straightforward solution is to reduce the number of embedded entities (which could make sense once caching works).

A minimum fail-safe implementation would check for the header size, and if the the size is too large, remove cache tags and disable caching for this specific response (=worst case a response cannot be cached, but at least doesn't resolve in a 500 error).

RelatedCollectionLink

The cache tags for a HTTP response are collected during normalization.

Due to the way we had to implement RelatedCollectionLinkNormalizer, too many cache tags are included in the response. This is related to the POC PR #3559, which would solve both performance issues and too many cache tags issue.

API platform implementation of cache tags

The implementation in api-platform/core seems to work functionally, but in my option is not very optimal. There are way too many entities purged during write operations.

The overall strategy of api-platform is to ensure responses are exact (=cached response and actual API response always need to match), sacrificing cache hit rate if necessary (see also this comment). This makes a lot of sense, however, I still think we could do better.

As an example, api-platform includes all IRIs in the cache tags, both from embedded and linked entities. However, the purge behaviour for linked entities could and should be different from embedded entities.

Need to dig into this a bit deeper though and will check if I include any improvements in this PR or open a PR directly to api-platform.

Edit: Opened a PR on api-platform for this

Frequent invalidation of collections

Every POST or DELETE operation will purge the collection resource. This is obviously necessary because, the collection response now contains 1 entity more or less. Due to the fact, that query parameters are not part of the cache tags, this purges every variation of the collection endpoint.

Example: POST on /activities to add a new activity to /camp/1 will purge

/activities: for every user
/activities?camp=1
/activities?camp=2
every other variation of /activities, even though the response has never changed

For applications like a CMS, where most of the operations are READ and only a few users edit entities, this might work. For an application like ecamp, this would however invalidate collection endpoints very frequently. To the extent, that the cache hit rate on collection endpoints would almost be 0, once enough users are working & editing on the platform.

As most our our queries are directly or indirectly scoped to a specific camp, one solution could be to include the campID as a mandatory part of the cache-tag. There could be various solutions to achieve this. The most straightforward variant is to include the campID directly in the URI as a uriVariable (as this is partially already supported by api-platform).

In this PR there's an example implemented for the category endpoint. Other ideas on how to solve this highly welcome.

Response depends on entities other than the ones included in the response body

If a response depends on entities other than the ones included in the normalization process, these entities would have to be added manually to the cache tags. In our case the most prominent example is access control to camp data trough CampCollaboration entities. In most responses, the CampCollaboration is not part of the actual response. However, the CampCollaboration entity defines whether I have access to the camp data or not, hence cache needs to be purged on change to the CampCollaboration entity.

In this PR, this is implemented for both Security voters, and the entity responsible for granting access to the resources is added to the cache tag (99c3723).

The other point in our code, where responses varies depending on other entities are doctrine filters in the repositories (most prominently FiltersByCampCollaboration). This is not implemented yet and is much related to the previous topic of frequent invalidation. Potential solutions:

Add all active campCollaborations to the cache-tags of collections + on POST/DELETE campCollaboration, all collection endpoints need to be purged
Camp specific endpoints (as in previous topic)

Further development

Production readiness

This example PR is obviously not production ready. Beneath the implementation of deployment, the following resources contain some VCL code snippets that are worth reviewing and implementing in case they make sense.

JWT parsing

Currently, the JWT cookies are included in the hash key (in the VCL) but no parsing of JWT happens on the reverse proxy side. However, this could be implemented in Varnish, for example to

Check the JWT for validity and already return on Varnish side, if JWT is invalid or has expired
Or: Parse the JWT, extract the userID and use the userID for the hash (this would reduce the hash size for example when users are logged in multiple times on different devices/browsers)

Resources for JWT parsing in Varnish (for my own documentation):

Blog post, HS256, cookies
https://feryn.eu/blog/validating-json-web-tokens-in-varnish/
Based on previous blog post, RS256, Authorization header
https://github.com/opositatest/varnish-jwt
Improved versions, supporting both HS256 and RS256
https://stackoverflow.com/questions/70607615/varnish-how-to-check-jwt-signature-using-digest-vmod
https://code.uplex.de/uplex-varnish/libvmod-crypto \
https://code.uplex.de/uplex-varnish/libvmod-frozen/tree/master/examples/jwt

Shared cached for a camp

As of this PR, users don't share cache data. However, within a camp, most if not all responses are identical between users who have read access to the camp. Hence, an idea to reduce cash size and increase hit rate would be to use campID as a hash key instead of the JWT-cookie or userID.

Theoretically, this would be feasible if:

campID is clearly identifiable in the request (e.g. part of the path itself or part of the query parameters)
campIDs need to be included in JWT as claims

This is definitely not straight-forward, so more of a "potential further development" at a later stage than part of an initial implementation.

Edit: Just found out, that there is a 2nd way to implement this without integrating camp claims into JWT. The FOSHttpCacheBundle has a functionality they call "User Context". In essence this is a preflight request from varnish to the backend/symfony asking for a hash key. Seems like an elegant solution, however, only makes sense if this preflight logic is really fast and the hit rate relatively high. Otherwise we just add additional latency.

FOSHttpCacheBundle

FOSHttpCacheBundle (which integrates FOSHttpCache into Symfony) is a symfony package which supports adding cache tags to responses and purging tags over reverse proxy APIs (currently supporting Varnish, Symfony Cache, Cloudflare, and others).

This is the package used in the youtube video linked earlier. The api-platform implementation is not based on FOSHttpCache (don't now why). But as of today, the current implementation in api-platform is very difficult to extend without touching the actual code of api-platform.

Switching to FOSHttpCache might be necessary, if we feel too limited by api-platform itself. Which is the own statement of api-platform (api-platform/core#952):

Note: for advanced needs, prefer the awesome FosHttpCache library.

… purged when own CampCollaboration is modified

carlobeltrame · 2023-07-03T08:07:14Z

Thanks a lot for digging into this, and documenting all this! This has really helped me to understand the varnish cache system better already. Below I brainstormed a few questions which I'd like to discuss at a meeting.

How much benefit do you think this will give us in practice? As you say, the current version does not share cache between users. So every single user visiting the app will initially get cache misses. After the initial page load, I expect the user to have a client-side cache in the vuex store. So the current state of this PR will only help with .$reload() requests, and when the user completely reloads the page, right? (And of course if in the future there are other API clients which don't use our hal-json-vuex)
How does the cache approach fit with the direction we're taking in PrintDataController Entpoint draft #3612? Caching would benefit from smaller responses, whereas PrintDataController Entpoint draft #3612 creates bigger responses.
How to debug problems with the cache on a deployment? Can we read logs somewhere? What monitoring metrics could we use? Does sentry have a varnish connector?
In which environments will we run this? On staging? On dev? On feature branch deployments? Locally?
Is the cache valid forever? Do we need to offer users a way to purge the cache or hard-hard reload? What about devs who switch their local git branch without restarting the cache? What about if a feature branch deployment (or even production) is updated with new commits, but helm does not need to restart the cache service?
How much actual performance gain will we get on deployments? You report a few 100ms latency going down to 3ms, but that's on localhost, right? On deployments, how much network latency do we have? And how much slower does the additional cache proxy make each request (also thinking ahead in case we implement cross-user caching).

usu · 2023-07-04T17:12:30Z

Trying to give some (opinionated) early answers.

How much benefit do you think this will give us in practice? As you say, the current version does not share cache between users. So every single user visiting the app will initially get cache misses. After the initial page load, I expect the user to have a client-side cache in the vuex store. So the current state of this PR will only help with .$reload() requests, and when the user completely reloads the page, right? (And of course if in the future there are other API clients which don't use our hal-json-vuex)

Difficult to say without measuring real live data. From a user perspective, initial data load after first login will always be a cache miss. Cache hits will be observed for:

$reload(): might not influence user experience too much. If we code the frontend smartly, reloads happen in the background and slower/faster requests might not be that noticeably for user
Manual reload of the page (Ctrl+F5)
closing the browser, reopen and JWT still in the cookie
closing the browser, reopen and relogin (if we use userID as hash and not the complete cookie)
Printing: The first attempt might fail (timeout), but subsequent attempts would read directly from the cache

However, this is from a user perspective. HTTP caching kind of serves two "customers", the individual user and the server (or overall application). So equally important the HTTP cache protects the server from too high load

General high number of users
Bad coding which results in many network requests
Users inadvertently DDoSing the application (e.g. printing very often, reloading a page very often)

Decreasing the load for the server in turn provides a faster experience for all the other cache misses.

Normally, these 2 benefits align: Higher hit rate provides faster user experience and at the same time decreases the load for the server. However, alignment is not always the case (see next question on response sizes), in which case a balance needs to be found between "best possible speed for each user" or "overall high load throughput".

For me personally, the second benefit (server) even outweights the first benefit (individual user). Having a caching functionality in the back of our hands would be a nice "weapon" to a have, once we decide to go out of beta and open the application to anyone.

Minor point: We have some endpoints (like / or /content_type) with public data, so we could decide to share these between users already in the first implementation.

How does the cache approach fit with the direction we're taking in PrintDataController Entpoint draft #3612? Caching would benefit from smaller responses, whereas PrintDataController Entpoint draft #3612 creates bigger responses.

Doesn't fit very well 😸

Caching and invalidation works better for smaller responses with less embedded data. I can see both options:

Keeping the PrintDataController later but accept the fact that it is probably not cacheable
Using PrintDataController as an intermediate solution and switch back to smaller responses, if and once caching works as desired

The same is true for other endpoints which embed a lot of data. We might decide to keep them as-is and accept that larger ones become uncacheable. Or we might decide to reduce embedding.

Increasing network requests and reduce embedding of data might be a disadvantage though for people on a network with high latency (just an assumption, would need to be tested). So this might be one of the tradeoffs between user incentive and server incentive.

How to debug problems with the cache on a deployment? Can we read logs somewhere? What monitoring metrics could we use? Does sentry have a varnish connector?

By default, varnish doesn't write any logs to disk but only to memory. So far I used varnishlog -g raw to print every log data during development. I'm sure there are smarter ways to do, but for development this was sufficient for me so far. I guess, this could also be integrated into Docker/Kubernetes logging, but haven't looked into this.

We should also add some headers in the response which indicate at least what is a cache hit and what is a miss. I've seen also snippets which enable this only, if a specific debug flag is set, to avoid providing too much information to outsiders.

For metrics: Haven't looked into this yet. There is a prometheus exporter, seems however not much development lately.

In which environments will we run this? On staging? On dev? On feature branch deployments? Locally?

My opinion (obviously to be discussed): On all deployments, so we see & test the same that later is pushed to production.

Locally, I guess we want developers to be able to develop & test both with and without caching. Especially for development on backend functionality, you probably want to develop first without HTTP cache and only sanity check at the end, if things work properly with caching enabled.

Is the cache valid forever? Do we need to offer users a way to purge the cache or hard-hard reload? What about devs who switch their local git branch without restarting the cache? What about if a feature branch deployment (or even production) is updated with new commits, but helm does not need to restart the cache service?

Currently (in this PR), cache expiration is set to 3600s/1h. I guess we could confidently live with much higher values, but this needs to be fine-tuned with real measurement (hit rate, cache memory size, etc.).

I don't think users should & would need a cache purge functionality. This is something we need to implement correctly and not something users should understand and worry about.

Maybe we want to have an admin functionality to purge the cache for troubleshooting cases.

For deployments, my naive approach would be to ensure that Varnish is always restarted or purged with each new deployment.

How much actual performance gain will we get on deployments? You report a few 100ms latency going down to 3ms, but that's on localhost, right? On deployments, how much network latency do we have? And how much slower does the additional cache proxy make each request (also thinking ahead in case we implement cross-user caching).

This goes back to user vs. server perspective.

From a user perspective, the relativ gain obviously depends a lot on their network. From South America I currently see the current timings on production server:

approx. 250ms to load the frontend index HTML (which is mostly static)
approx. 450ms to load /camps

So a cached version of /camps would save me probably 200ms (approx. 50%) in this specific case. Results for someone with fiber from CH are hopefully different :-)

And also depends on the endpoints. On my localhost (without xdebug), most requests are in the range of 100-200ms. But I think I've also seen requests >1s on deployments during performance testing.

Additional latency of cache proxy: Purely proxy side, neglectable. Varnish is really fast. Unless we implement something like preflight requests to the backend.
On backend side, I believe the collection of cache tags shouldn't add more latency, as most of the logic is being executed already today. The header size is larger which increases transmission time, so we could decide to remove the cache tags after Varnish has processed them. Purge actions might add a bit of latency, because we make an API call to Varnish.

From a server perspective, the performance increase directly correlates with the hit rate we can achieve.

…it exists

usu · 2023-07-19T13:31:01Z

Edit: I added a 2nd example for camp specific routes for ScheduleEntry. I figured that the previous example with Category was too simple, because Category is a direct child of Camp. The example for ScheduleEntry is more generic and works with all resources of type BelongsToCampInterface.

Relevant commits:
2353348
4b1f3c6

This is also related to api-platform/core#5673

Edit 2: This becomes a bit cleaner once api-platform/core#5732 is merged & released.

BacLuc · 2023-08-07T14:55:12Z

Sounds really cool, also good that you went far enough to find the first potholes.
Because it is very powerfull, it also adds a lot of complexity.
When the cache is running, we will maybe need to remove some embedding of entities, but be sure that
we still make the necessary requests to fetch the data in the frontend. (Like the problem we ran into when working #3555).

A possible approach would be:

Get the caching working for only one endpoint, where its not that tragic if it is a cache miss or a faulty cache miss, but where the content does not change that much. (e.g. /categories).
Get this working in production.

4.Get some monitoring working in production. (cache hits, misses, request duration on cache miss, cache purges, resource usage).
Make sure that on a deployment the cache is invalidated or kept (As we want it), i would vote to invalidate the cache, else debugging more difficult.

Gradually add more endpoints to the cache. Be aware of debugging issues of developers. Monitor the performance.

As most our our queries are directly or indirectly scoped to a specific camp, one solution could be to include the campID as a mandatory part of the cache-tag. There could be various solutions to achieve this. The most straightforward variant is to include the campID directly in the URI as a uriVariable (as this is partially already supported by api-platform).

In this PR there's an example implemented for the category endpoint. Other ideas on how to solve this highly welcome.

Whan other maybe naive approach would be to let the client include the campId in a header, aka "i am now navigating on camp xy, give me the cached responses of this camp"
That way we don't need to move all the endpoints or use some magic to include the camp in the response headers.
If we share the cache between users (which maybe increases cache hits), we need to validate the cookie that no data is leaked.
And if the cache is camp scoped, then we need to verify that this user has access to the camp. (e.g. claims in the cookie),
but then also update the claim in case he leaves or enters a camp.

manuelmeister · 2023-08-15T20:25:36Z

Core Meeting Decision

2 Endpoints:

ContentType
Category in camp

Create a feature toggle

…: also purge old and new collection routes

usu · 2024-05-18T10:41:15Z

@pmattmann I addresses the bug we were shortly discussing at the end of the meeting (see commit 3f604db). In case you want to have a look at it.

For deletion, I now compute the state of the original entity to generate the collection routes for purging.
For updates, I use both old and new state and I purge only, if the collection IRI is different (for example, if a schedule entry was moved from one period to another)

api/src/HttpCache/PurgeHttpCacheListener.php

BacLuc · 2024-05-20T12:47:59Z

Looks very good. There are some small improvements, but overall this PR is soon mergeable. It also gets very large, so maybe we can postpone some things.
But it would be good if we don't make the sytem tests flakier. I tested it here, and i had an error in 1 of 30 runs: https://github.com/BacLuc/ecamp3/actions/runs/9050400566/job/24865712438
Maybe you see why this may happen, or we can add the system tests later

This seems to be a problem, specially on firefox: https://github.com/BacLuc/ecamp3/actions/runs/9050526284/job/24866219313

I don't know what improved the tests, i assume ae74c7b
but it is a lot better now

…rgeHttpCacheListener

pmattmann · 2024-05-25T12:02:46Z

Looks good.

I need a quick refresher on one point.

The endpoint /api/camps/{campid}/categories contains values in the xkey-header which I don't understand yet. For which scenario are separate tags like {category-id}#camp inserted for _links?

Shouldn't this only be done for referenced link collections?
Like {category-id}#preferredContentTypes

usu · 2024-05-25T14:48:00Z

I need a quick refresher on one point.

The endpoint /api/camps/{campid}/categories contains values in the xkey-header which I don't understand yet. For which scenario are separate tags like {category-id}#camp inserted for _links?

Shouldn't this only be done for referenced link collections? Like {category-id}#preferredContentTypes

At the moment, all relations are included in the tag list, just to play it safe we don't miss any edge case.

As you identified correctly, in the case of {category-id}#camp this is really never used for purging, because, firstly, the camp of a category cannot be changed, add secondly, if it would be allowed, I'd update the camp from the category side hence purging the tags {category-id} as well as {old-camp-id}#categories and {new-camp-id}#categories.

This is probably an area for later optimization (=smaller list of cache tags). But he have to careful here, we cannot include collections only. Most probably it's safe to remove ManyToOne relations (like {category-id}#camp). But he have to specifically look at OneToOne and ManyToMany relations as well as unidrectional relations as well.
(assumption: we need to include relations on the inverse-side, but could potentially remove them from the owning-side)

pmattmann · 2024-05-25T15:03:25Z

I tried to filter out the ToOne relations, but we can also keep the PR in the back - in case we get a problem with the allowed header value size.

usu#19

usu · 2024-05-25T16:03:33Z

I tried to filter out the ToOne relations, but we can also keep the PR in the back - in case we get a problem with the allowed header value size.

usu#19

We would still need to figure out how we solve OneToOne relations, though. I'd prefer if we keep this as a backup-PR at the moment. And we take it up as an optimization, once we implement caching for the first resource with a OneToOne-relation. Ok for you?

manuelmeister · 2024-05-25T17:04:39Z

👏👏👏 nice

usu added 5 commits June 29, 2023 19:20

initial Varnish setup

ec47483

enable xkey purger

b44d9af

add resources used in security voters to cache tags; ensures cache is…

99c3723

… purged when own CampCollaboration is modified

copy PurgeHttpCacheListener from ApiPlatform

6fbecb3

POC: camp specific URL for category collection

ccced29

usu added the Meeting Discuss Am nächsten Core-Meeting besprechen label Jul 2, 2023

fix docker startup sequence

3dbbd5d

usu added 2 commits July 18, 2023 10:22

2nd example for camp specific URL (deeply nested entity)

2353348

adjust RelatedCollectionLinkNormalizer to use camp specific route if …

4b1f3c6

…it exists

BacLuc mentioned this pull request Aug 15, 2023

Performance optimization big plan #3717

Closed

manuelmeister removed the Meeting Discuss Am nächsten Core-Meeting besprechen label Aug 15, 2023

usu added 9 commits August 27, 2023 12:54

use new TagCollector service

c359c19

use id instead of IRI

ec7d36a

purge subresources

2ea5d0c

switch to FosHttpCacheBundle

ece1c18

merge devel

c809117

add e2e tests for http cache

07962c4

merge devel

617b7e9

test: invalidate cache on campCollaboration update

79e312c

implement: invalidate cache on campCollaboration update

9d2f865

usu mentioned this pull request Oct 14, 2023

Activity fetch behaviour #3555

Closed

usu added 4 commits December 29, 2023 15:01

merge devel

0653aab

fix dependencies from devel merge

f230aee

upgrade to api-platform/core main branch (pre-3.3)

285115d

fix array access

a0bc285

merge devel

51788fd

usu removed the Meeting Discuss Am nächsten Core-Meeting besprechen label May 15, 2024

usu temporarily deployed to feature-branch May 15, 2024 20:18 — with GitHub Actions Inactive

delete: use orignal entity data to generate collection rotues; update…

3f604db

…: also purge old and new collection routes

usu temporarily deployed to feature-branch May 18, 2024 10:41 — with GitHub Actions Inactive

merge devel

fd99973

usu temporarily deployed to feature-branch May 18, 2024 10:46 — with GitHub Actions Inactive

pmattmann reviewed May 18, 2024

View reviewed changes

api/src/HttpCache/PurgeHttpCacheListener.php Outdated Show resolved Hide resolved

BacLuc approved these changes May 20, 2024

View reviewed changes

usu added 2 commits May 21, 2024 16:25

don't catch InvalidArgumentException|OperationNotFoundException in Pu…

dd199d7

…rgeHttpCacheListener

merge devel

5c82d0c

usu temporarily deployed to feature-branch May 21, 2024 14:32 — with GitHub Actions Inactive

pmattmann approved these changes May 25, 2024

View reviewed changes

usu added 2 commits May 25, 2024 17:56

fix psalm

81509a7

merge devel

f0c9260

usu temporarily deployed to feature-branch May 25, 2024 16:04 — with GitHub Actions Inactive

fix test

fac40ca

usu enabled auto-merge May 25, 2024 16:30

usu temporarily deployed to feature-branch May 25, 2024 16:33 — with GitHub Actions Inactive

usu added this pull request to the merge queue May 25, 2024

Merged via the queue into ecamp:devel with commit 43a9ae3 May 25, 2024
61 checks passed

BacLuc mentioned this pull request Jul 10, 2024

Deploy to prod #5514

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

POC: HTTP reverse proxy cache #3610

POC: HTTP reverse proxy cache #3610

usu commented Jun 30, 2023 •

edited

Loading

carlobeltrame commented Jul 3, 2023 •

edited

Loading

usu commented Jul 4, 2023

usu commented Jul 19, 2023 •

edited

Loading

BacLuc commented Aug 7, 2023

manuelmeister commented Aug 15, 2023 •

edited

Loading

usu commented May 18, 2024

BacLuc commented May 20, 2024

pmattmann commented May 25, 2024

usu commented May 25, 2024

pmattmann commented May 25, 2024

usu commented May 25, 2024

manuelmeister commented May 25, 2024

POC: HTTP reverse proxy cache #3610

POC: HTTP reverse proxy cache #3610

Conversation

usu commented Jun 30, 2023 • edited Loading

To do:

To do after review / before merging:

Blocked by:

General

What's the purpose

Cache tags & surrogate keys

Implementation in API/core

Souin vs. Varnish

This PR

How to test out

What is implemented

Use cases & examples

Basic caching functionality

Basic invalidation/purge

Invalidation of collection

Cache scoped by JWT

Invalidation via CampCollaboration

Invalidation scoped by camp

Potential issues

Too many surrogate keys (header too long)

RelatedCollectionLink

API platform implementation of cache tags

Frequent invalidation of collections

Response depends on entities other than the ones included in the response body

Further development

Production readiness

JWT parsing

Shared cached for a camp

FOSHttpCacheBundle

carlobeltrame commented Jul 3, 2023 • edited Loading

usu commented Jul 4, 2023

usu commented Jul 19, 2023 • edited Loading

BacLuc commented Aug 7, 2023

manuelmeister commented Aug 15, 2023 • edited Loading

Core Meeting Decision

usu commented May 18, 2024

BacLuc commented May 20, 2024

pmattmann commented May 25, 2024

usu commented May 25, 2024

pmattmann commented May 25, 2024

usu commented May 25, 2024

manuelmeister commented May 25, 2024

usu commented Jun 30, 2023 •

edited

Loading

carlobeltrame commented Jul 3, 2023 •

edited

Loading

usu commented Jul 19, 2023 •

edited

Loading

manuelmeister commented Aug 15, 2023 •

edited

Loading