mirror of
https://github.com/LemmyNet/lemmy.git
synced 2024-11-14 00:14:03 +00:00
Merge pull request #1285 from Whayme/patch-1
Short intro explanation, reformatted as a table
This commit is contained in:
commit
7d7fe5962a
1 changed files with 8 additions and 7 deletions
|
@ -1,16 +1,17 @@
|
||||||
# Trending / Hot / Best Sorting algorithm
|
# Trending / Hot / Best Sorting algorithm
|
||||||
## Goals
|
|
||||||
|
An expected feature in link aggregators is a kind of "Trending" sort which shows users a mixture of new posts / comments and popular ones, making for a display order which highlights the most currently active parts of the site / thread. This keeps the experience fresh and makes sure the site stays moving. Various flaws can be found in the ways that popular link aggregators like Reddit have implemented "Hot" or "Trending" sorts, so Lemmy has its own algorithm.
|
||||||
|
|
||||||
|
## Goals and Considerations
|
||||||
- During the day, new posts and comments should be near the top, so they can be voted on.
|
- During the day, new posts and comments should be near the top, so they can be voted on.
|
||||||
- After a day or so, the time factor should go away.
|
- After a day or so, the time factor should go away.
|
||||||
- Use a log scale, since votes tend to snowball, and so the first 10 votes are just as important as the next hundred.
|
- Use a log scale, since votes tend to snowball, and so the first 10 votes are just as important as the next hundred.
|
||||||
|
|
||||||
## Reddit Sorting
|
| Reddit | Hacker News | Lemmy |
|
||||||
[Reddit's comment sorting algorithm](https://medium.com/hacking-and-gonzo/how-reddit-ranking-algorithms-work-ef111e33d0d9), the wilson confidence sort, is inadequate, because it completely ignores time. What ends up happening, especially in smaller subreddits, is that the early comments end up getting upvoted, and newer comments stay at the bottom, never to be seen. Research showed that nearly all top comments are just the [first ones posted.](https://minimaxir.com/2016/11/first-comment/)
|
|-|-|-|
|
||||||
|
| Does not take the lifetime of the thread into account, [giving early comments an overwhelming advantage over later ones,](https://minimaxir.com/2016/11/first-comment/) with the effect being even worse in small communities. New comments pool at the bottom of the thread, effectively killing off discussion and making each thread a race to comment early. This lowers the quality of conversation and rewards comments that are repetitive and spammy. | While far superior to Reddit's implementation for its decay of scores over time, [Hacker News' ranking algorithm](https://medium.com/hacking-and-gonzo/how-hacker-news-ranking-algorithm-works-1d9b0cf2c08d) does not use a logarithmic scale for scores. | Counterbalances the snowballing effect of votes over time with a logarithmic scale. Negates the inherent advantage of early comments while still ensuring that votes still matter in the long-term, not nuking older popular comments. |
|
||||||
|
|
||||||
## Hacker News Sorting
|
## Additional Details
|
||||||
The [Hacker New's ranking algorithm](https://medium.com/hacking-and-gonzo/how-hacker-news-ranking-algorithm-works-1d9b0cf2c08d) is great, but it doesn't use a log scale for the scores.
|
|
||||||
|
|
||||||
## My Algorithm
|
|
||||||
```
|
```
|
||||||
Rank = ScaleFactor * log(Max(1, 3 + Score)) / (Time + 2)^Gravity
|
Rank = ScaleFactor * log(Max(1, 3 + Score)) / (Time + 2)^Gravity
|
||||||
|
|
||||||
|
|
Loading…
Reference in a new issue