Communities

Writing
Writing
Codidact Meta
Codidact Meta
The Great Outdoors
The Great Outdoors
Photography & Video
Photography & Video
Scientific Speculation
Scientific Speculation
Cooking
Cooking
Electrical Engineering
Electrical Engineering
Judaism
Judaism
Languages & Linguistics
Languages & Linguistics
Software Development
Software Development
Mathematics
Mathematics
Christianity
Christianity
Code Golf
Code Golf
Music
Music
Physics
Physics
Linux Systems
Linux Systems
Power Users
Power Users
Tabletop RPGs
Tabletop RPGs
Community Proposals
Community Proposals
tag:snake search within a tag
answers:0 unanswered questions
user:xxxx search by author id
score:0.5 posts with 0.5+ score
"snake oil" exact phrase
votes:4 posts with 4+ votes
created:<1w created < 1 week ago
post_type:xxxx type of post
Search help
Notifications
Mark all as read See all your notifications »
Q&A

Welcome to Codidact Meta!

Codidact Meta is the meta-discussion site for the Codidact community network and the Codidact software. Whether you have bug reports or feature requests, support questions or rule discussions that touch the whole network – this is the site for you.

Regular deletion (roomba) of content unlikely to ever be useful

+9
−0

I noticed that questions exist on this network and are accessible that are unlikely to ever be very helpful, for example because they were marked as completely off-topic.

Examples

The content license does not mean that there is an obligation to keep the content here and removing such content would increase the signal to noise ratio. I don't see much sense in keeping this content. We could therefore permanently remove it from the system.

But there are also questions that have a very low score (typically they are not very high quality with missing information) and often also no answers but aren't closed. With edits and answers they could potentially be converted to something useful, but that may not be very likely. The decision to keep or remove such content might be a bit more difficult.

However, cleaning up more regularly might also increase the appeal of the front pages of the individual sites (see recent discussion).

Should we regularly remove content we deem to be not useful at all?

If yes, what should be the criteria for that (close status, score, number of answers, life time)? Should it be done automatically (automatic cleaning robots) or rather manually (at least for now)? Where should the criteria for that be decided (for each community individually or network-wide)?

Searched for it on Meta but couldn't find anything for "automatic deletion".

History
Why does this post require moderator attention?
You might want to add some details to your flag.
Why should this post be closed?

0 comment threads

4 answers

You are accessing this answer with a direct link, so it's being shown above all other answers regardless of its score. You can return to the normal view.

+1
−0

I purpose auto removing bad questions, after a given grace period

But, this requires a definition of a bad question.

What's a bad question ?

While that seems like a very subjective question, that doesn't mean it can't be answered somehow objectively

  • initially, it's any question that shouldn't be in a community according to its users

  • which is any question with a very low score (as all the 3 examples in the OQ)

  • a low score: my be defined as a score in the lower 5% of the scores globally, or in a community, or just as a constant LS, so that we can have something like

if(question.score < LS)
    question.init_removal_process()

We may then:

  • add to that some other factors to initiate the removal process,

  • use a hidden score for a question value that take on account how much it was reached, give more weight to upvotes than downvotes, check if it had answers or no, ect.. instead of question.score = up_votes - down_votes

  • make the process a bit less robotic, by notifying the poster and/or a community mode sometime before the post removal


Note 1: By removing I mean make it invisible, unindexable and unreachable, (which can be done softly)

Note 2: Bad, is very subjective, and a badly posed/received question may be improved, or perceived differently by others. So whatever, the definition that we chose, it should be flexible and lenient

Note 3: Among the 3 example questions, there is only one that I wholeheartedly agree that it shouldn't be in a serious QnA website, even before opening it. and it's the one with the lowest score

History
Why does this post require moderator attention?
You might want to add some details to your flag.

0 comment threads

+7
−0

I agree that automatic deletion1) would be a useful way to reduce clutter and manual moderation. I would propose implementing something along the lines of the draft below.

1) Deleted as in "not displayed on the site" - the technical definition on what gets archived and what gets deleted from the actual DB is a discussion for another post.


All posts get automatically deleted after 1 month since the last edit of any question/answer present, if they fulfil any of the following:

  • Closed posts without answers.
  • Closed posts where both the question and all answers have negative score.
  • Questions with a negative score of -3 or more that have no answers, or where all answers also have a negative score of at least -3.

All the details can obviously get fine-tuned and more rules can be added as we come up with them.

History
Why does this post require moderator attention?
You might want to add some details to your flag.

3 comment threads

And until the feature becomes implemented... (11 comments)
Strict -3, or a Wilson score threshold? (4 comments)
What about duplicates? (5 comments)
+2
−0

Yes, we need to clean things up

For a Q&A site to come across as useful, the chaff simply needs to be separated from the wheat. We can't expect external search engines to care about our internal quality metrics; at best we can get them to care about how many times a post is linked internally, and even that's pretty wonky. If search results for things on Codidact turn up bad questions, that makes the site look bad - even worse than if it fails to turn up relevant questions.

But maybe don't delete things aggressively

It might be better to shuffle things off automatically to a different category - as long as it's done in a way that search engines can meaningfully care about. It's good if people get (correctly) the impression that Codidact communities can offer more individualized help as well as putting forward a high-quality canon, without mixing the two. That's the sort of thing that makes a community feel like a community: being both caring and goal-oriented.

If things that aren't in the main Q&A section can be held to different (lower) standards, that offers the opportunity for a more welcoming experience for everyone. Instead of having to filter tons of junk out of the place where properly-formed Q&A is presented, curators can cherry-pick from an explicit, more-help-desk-like pile to see what issues are common, create canonical Q&A, and then either start closing duplicates or at least providing a reference that answer-writers can use to shorten their answers (dramatically, in most cases). People who have a problem and really want a forum experience (or debugging help) don't have to look elsewhere (although I imagine we would still have considerably stricter requirements than most actual discussion forums), and people who like answering those kinds of questions can take advantage of an environment where the question is fixed in-place at the top of the page.

However, for this to work, it needs to be easy - ideally, even the default - for web searches to land people in the top-quality content. Making this work properly might involve using subdomains carefully.

What's "bad"?

I've remarked before that it's not very useful that a "needs author's attention" flag simply redirects people to the comment interface. As has been proven to me time and time again in a Specific Outside context along with many other Similar Experiences, there are all kinds of people out there champing at the bit to be "helpful" with barely comprehensible beginner ramblings, making herculean efforts to read OP's mind and interpret a question when nothing has been clearly asked - and the availability of AI has only made it exponentially worse.

In the long run, expecting multiple people to act to close a bad question before anyone can try to offer answers that clog the site and drag down average quality... is simply not scalable. Not even if you have multiple clearly-defined reasons for closure and a whole subsection of Meta dedicated to explaining the how and why of it. By my reading, "needs author's attention" is largely a catch-all for questions that:

  • are unclear (whether because they are written in a language the community doesn't want to support, are seeking to do something that doesn't make any sense to attempt, don't clearly define a recognizable problem...)

  • are missing specific crucial pieces of information (this can cause the perception of being "unclear", but is really a separate issue)

  • are trying to ask too much

  • reflect an idiosyncratic issue that prevents the question from contributing to a Q&A canon that can help others (whether it's a "typo", logical oversight or whatever is equivalent to that for a given community; or is just otherwise too specific to an individual's exact setup)

  • are subjective in a way that doesn't work well in the Q&A format

My assessment is:

  • This is missing the problem of questions simply being blatantly off topic. If I want to flag such a post, it seems like I'm expected to use the "other" reason, and I think that's absurd. Even if it's only subtly off topic, it might easily be recognized as irreparably off topic.

  • The idiosyncratic issues, on the other hand, are the exact sort of thing that I can imagine being permitted in a separate category. For example, if there are a lot of people who want to debug code, a lot of people who want code debugged, and a lot of potential to "farm" good Q&A out of that... I don't want to keep those people out of the Software community, I just want to keep their "questions" and "answers" separate from the actual Q&A.

  • Aside from those two points: the key metric for a "bad question" is that it "needs the author's attention" and doesn't get that attention. Aside from blatantly off-topic content, these are the questions I'd like to see actually deleted. Even in an explicit "tech support" context, there's no point in keeping a question around if answering it requires input from one specific person and that person isn't forthcoming.

  • On the other hand, the lack of such information is a reason to try to keep the question "out of the way" - in a place where curators can easily find it, but answer-writers have more difficulty - until it's provided. That is, to my understanding, the primary reason for questions to have a "closed" status. (I would like for default views to filter out such questions, while making it easy to include or even focus on them.)

We can adjust policy by site volume

As it stands, we get so few questions, primarily from a community of relatively skilled enthusiasts, that anything that doesn't measure up to standards sticks out like a sore thumb (and there are plenty of resources to deal with the problem).

If Codidact communities are to grow in the way that we seem to hope they will, we must accept that this will not always be the case. Any discussion like what we're having here is inherently forward-looking. I can see a path that leads all the way to "new questions start out closed or in a separate section, and must be workshopped, categorized (as potential canonical material vs. yet another 'support ticket') and approved before they can be published to receive answers".

History
Why does this post require moderator attention?
You might want to add some details to your flag.

1 comment thread

P.S. (1 comment)
+1
−1

I personally like the way you are contributing(not for voting, re-editing posts and some other stuffs).. You may have seen somewhere that, only 2 people are working as coder in the Codidact organization. So it's taking huge time to build up things. I believe Ruby on rails is little bit harder that's why most of peoples can't contribute to code. I was also a contributor. All the things I understood after working on the organization that is, "maybe I can't understand how one page is linked with another one in ruby-on-rails". One of my friend, also said that, "after QPixel he hates ruby-on-rails" (not sure if I phrased correctly). ~ I said it cause you were looking for automatic deletion.

Everyone thinks deleting poor questions may help to speed up the growth of the community. But a staff (it's more than 3 months so I don't remember if that was staff or someone else) said, "removing all imported contents will be helpful cause some of our communities are getting down for copyright-infringement but what can we do if someone had edited and done some effort on that post?"

We could therefore permanently remove it from the system.

Removing something from database permanently is never helpful. Cause there is a report in Writing which had happened for "data error". We can remove thing softly- the way we are doing now.

Usually, all the bad posts are posting by some specific users. When we say something by mentioning them, then they don't reply to our messages. I believe we should have discussion with them. It's OK not to know things but deleting their contents without informing them isn't really a good idea. So I always suggest to discuss with those persons. I sometimes worry why they don't want to listen to us.

I had asked in Discord once when do a moderator remove the participate and participate everywhere ability, no one replied. But if we revoke those abilities then they might come to us to discuss (I don't know even I don't think anyone will listen to me but all the things I say that is I don't care). Some may say to suspend them, I said in the answer why suspending is never helpful.

History
Why does this post require moderator attention?
You might want to add some details to your flag.

2 comment threads

Workload of devs is irrelevant (4 comments)
Revoking or suspending abilities (1 comment)

Sign up to answer this question »