Begin with fast checks that explain behavior, then add slower, heavier checks only when they reduce risk.
大多数人可能认为全面的评估应该从一开始就包含所有可能的检查,但作者主张渐进式评估方法,从快速检查开始,只在必要时添加更复杂的检查。这个观点挑战了'一次性全面测试'的常规做法,主张风险驱动的评估策略。
Begin with fast checks that explain behavior, then add slower, heavier checks only when they reduce risk.
大多数人可能认为全面的评估应该从一开始就包含所有可能的检查,但作者主张渐进式评估方法,从快速检查开始,只在必要时添加更复杂的检查。这个观点挑战了'一次性全面测试'的常规做法,主张风险驱动的评估策略。
A common piece of advice for working with AI coding tools is to simply write more tests because if the tests pass, the code is fine.
大多数人认为只要测试通过,代码就是好的,但作者指出过度编辑问题使得测试难以全面评估代码质量。
一个独立的本地 HTTP 服务器,模拟 𝕏 API v2 的行为,带交互式 Web UI。可以在不消耗真实 API 额度的情况下测试代码逻辑。
令人惊讶的是:𝕏提供了本地API模拟器'Playground',允许开发者在不消耗实际API额度的情况下测试代码,这种做法在大型API提供商中并不常见。它不仅降低了开发成本,还提高了开发效率,显示出𝕏对开发者体验的重视程度超出了行业平均水平。
feat(benchmarks): add screenshot-based evaluator, screenshot collector, and --parallelize flag - Add screenshot-based LLM judge evaluator (evaluator.ts) - Add ScreenshotCollector for capturing browser screenshots during runs
令人惊讶的是:这个项目包含一个基于截图的评估系统,使用LLM作为评判员来评估自动化任务的结果。它能够捕获浏览器截图并在运行过程中收集这些视觉数据,这为网页自动化任务提供了一种全新的评估方式,超越了传统的文本比较方法。
but would fail recognize that the feature didn't work end-to-end
这揭示了Agent在认知上的盲区:它容易陷入“代码视角”的自证预言,以为单元测试通过就等于功能完整。引入端到端浏览器自动化测试,是强迫Agent站在“用户视角”去验证,这是从开发者思维向产品思维跨越的关键。
This class of bug is insidious because it evades every layer of defense. It will not be caught in development testing — who runs a test for 50 days? It will not be flagged in code review — the logic looks perfectly reasonable.
大多数人认为代码审查和测试能捕获大多数系统性缺陷,但作者认为这个bug的特殊性使其能够逃避所有常规检测手段。这挑战了软件质量保证的基本假设,暗示某些缺陷只有在极端条件下才会显现,而常规开发流程无法覆盖这些场景。
a clockwork watch may be ticking normally, such that it’s very hard to tell that it is likely to break down next month, but opening up the watch and looking inside can reveal mechanical weaknesses that allow you to figure it out.
for - AI - progress trap - interpretability testing - deception - Is it possible that AI could even change their node behavior as a deceptive move?
similar to how neuroscientists study animal brains by correlating measurement and intervention to external stimuli and behavior
for - AI - testing - interpretability - neuroscience techniques
c'est aujourd'hui dans les services départementaux que ceux qui 00:54:14 se déclarent mineurs se heurent aux obstacles les plus insurmontables il arrive qu'il soit rejeté dès la demande de rendez-vous par l'agent administratif chargé de l'accueil qui simplement dit au jeunes qu' n'est pas mineur sur sa seu 00:54:27 ne lui laissant même pas la possibilité de déposer un dossier toutefois le plus souvent c'est l'évaluation proprement dite de minorités qui s'avère l'étape infranchissable cette évaluation est souvent faite par des personnels sans 00:54:38 formation spécifique et soumise à la pression politique du Conseil départemental dont les élus s'inquiètent des dépenses supplémentaires occasionnées par la prise en charge des mineurs non accompagnés et ce bien qu'un plan national a été mise en place pour 00:54:51 répartir ces derniers sur tout le territoire
l'enquête que j'ai conduite pendant 5 ans avec annecler de faossé 00:53:00 dans les Hautes Alpes près de la frontière italienne dans une région qui est autour du col de Montgenèvre l'un des principaux points d'entrée en France par le sud-est l'un des deux points d'entrée en 00:53:11 en France par le sud-est montre que le protocole décrit dans les textes n'est généralement pas suivi alors même que depuis 2016 la proportion des jeunes garçons en provenance d'Afrique sub-saharienne se déclarant mineur a 00:53:24 souvent été élevée alors frontière d'abord on a eu pendant plusieurs années un refoulement presque systématique par les policiers chargés de la garder qui contestait l'âge déclaré par le jeune voire déchirer son acte de naissance 00:53:38 jusqu'à ce que plusieurs condamnations de l'État par les tribunaux administratifs ne conduisent ce dernier à se montrer plus respectueux de la loi en adressant ses garçons à l'aide sociale à l'enfance via une association 00:53:50 locale ce qui est désormais fait le plus souvent certains agents récal Citran continuant CEP pendant à les renvoyer en Italie parfois après avoir falsifié leurs documents ainsi que le constatent 00:54:02 les associations qui de l'autre côté de la frontière côté italien conserve les copies des originaux
Those who boast of such high cognition should not keep it backbut should present it publicly to be tested and esteemed.
When writing component tests that involve two-way bindings, context or snippet props, it’s best to create a wrapper component for your specific test and interact with that. @testing-library/svelte contains some examples.
Picasso tadpoles
for - adjacency - intelligence - testing William James definition of inteligence - Picasso tadpole - artificially mixed up initial tadpole embryo state - to normal frog state - collective intelligence has problem-soving ability that chooses a different pathway to achieve the same goal
Vitest makes writing tests directly within your source code easy, eliminating the need for separate test files. This approach, known as in-source testing, is useful when you want to quickly test individual functions without the overhead of creating and managing separate test files.
Documents
This is an example Hypothesis annotation. It depends on selecting text. It can be set to be entirely public, or restricted to a private Group.
Interact with live systems whenever feasible instead of mocking components to uncover potential integration issues.
Today I realized that the pursuit of knowledge is a lifelong journey. It’s important to question what we know to avoid biases.
Let me know if you see this
Enhances ActionMailer to support the :cache delivery method, which behaves like :test, except that the deliveries are marshalled to a temporary cache file, thus making them available to other processes.
There are two things that I enjoy: a test framework written in my own Domain Specific Language (DSL) that is easily understood by all those on a project and the ability for all participants to easily read, search, and view the tests.
Disable all observers in your test suite by default. They should not be complicating your model tests because they should have separate concerns anyway. You don't need to unit test that observers actually fire, because ActiveRecord's test suite does that, and your integration tests will cover it.
I emphatically disagree with BlueFish about observers being difficult to properly unit test. This is precisely the biggest point that distinguishes them from lifecycle callbacks: you can test observers in isolation, and doing so discourages you from falling into many of the state- and order-heavy design pitfalls BlueFish refers to (which again I think is more often true of lifecycle callbacks).
Preface xiIntroduction
Hello! Can you see this annotation on a PDF?
flip to the acknowledgments in cookbooks to see if testers are thanked.
This is the most simulative version of a controller. It will try and mimic real user behaviour. It's the recommended version to use when the goal of the load-test is finding out how many concurrently active users the target instance supports.
How to test a typewriter before buying by [[Retrotype]]
As of right now the full build takes over an hour to run, and this makes cycling for PRs and quick iterative development very difficult.
Virginia
Hello1
l'enquête que j'ai conduite pendant 5 ans avec annecler de faossé 00:53:00 dans les Hautes Alpes près de la frontière italienne dans une région qui est autour du col de Montgenèvre l'un des principaux points d'entrée en France par le sud-est l'un des deux points d'entrée en 00:53:11 en France par le sud-est montre que le protocole décrit dans les textes n'est généralement pas suivi alors même que depuis 2016 la proportion des jeunes garçons en provenance d'Afrique sub-saharienne se déclarant mineur a 00:53:24 souvent été élevée alors frontière d'abord on a eu pendant plusieurs années un refoulement presque systématique par les policiers chargés de la garder qui contestait l'âge déclaré par le jeune voire déchirer son acte de naissance 00:53:38 jusqu'à ce que plusieurs condamnations de l'État par les tribunaux administratifs ne conduisent ce dernier à se montrer plus respectueux de la loi en adressant ses garçons à l'aide sociale à l'enfance via une association 00:53:50 locale ce qui est désormais fait le plus souvent certains agents récal Citran continuant CEP pendant à les renvoyer en Italie parfois après avoir falsifié leurs documents ainsi que le constatent 00:54:02 les associations qui de l'autre côté de la frontière côté italien conserve les copies des originaux
c'est aujourd'hui dans les services départementaux que ceux qui 00:54:14 se déclarent mineurs se heurent aux obstacles les plus insurmontables il arrive qu'il soit rejeté dès la demande de rendez-vous par l'agent administratif chargé de l'accueil qui simplement dit au jeunes qu' n'est pas mineur sur sa seu 00:54:27 ne lui laissant même pas la possibilité de déposer un dossier toutefois le plus souvent c'est l'évaluation proprement dite de minorités qui s'avère l'étape infranchissable cette évaluation est souvent faite par des personnels sans 00:54:38 formation spécifique et soumise à la pression politique du Conseil départemental dont les élus s'inquiètent des dépenses supplémentaires occasionnées par la prise en charge des mineurs non accompagnés et ce bien qu'un plan national a été mise en place pour 00:54:51 répartir ces derniers sur tout le territoire
if your treatments are ordered, don't compare each mean with each other mean (multiple comparisons), instead do one test for trend to ask if the outcome is linearly related with treatment number
How do you do hypothesis testing for trends for an ordered categorical variable?
Could you convert x to numbers (1,2,3) and run a linear regression y ~ x? or even categorical ordered variables can be linearly regressed?
cognitive skills. Therefore, keeping the designs clean and safe from clutter and distractions is essential. This will help kids get the b
Which sounds work best for my age group/target audience?
We quote because we are afraid to-change words, lest there be a change in meaning.
Quotations are easier to collect than writing things out in one's own words, not only because it requires no work, but we may be afraid of changing the original meaning by changing the original words or by collapsing the context and divorcing the words from their original environment.
Perhaps some may be afraid that the words sound "right" and they have a sense of understanding of them, but they don't quite have a full grasp of the situation. Of course this may be remedied by the reader or listener not only by putting heard stories into their own words and providing additional concrete illustrative examples of the concepts. These exercises are meant to ensure that one has properly heard/read and understood a concept. Psychologists call this paraphrasing or repetition the "echo effect" (others might say parroting or mirroring) and have found that it can help to build understanding, connection, and likeability between people. Great leaders who do this will be sure to make sure that credit for the original ideas goes to the originator and not to themselves simply because they repeated it, especially in group settings where their words may have more primacy amidst their underlings.
(I can't find it at the moment, but there's a name/tag for this in my notes? looping?)
Beyond this, can one place the idea into a more clear language than the original? Add some poetry perhaps? Make the concept into a concrete meme to make it more memorable?
Journalists like to quote because it gives primacy of voice to the speaker and provides the reader with the sense that they're getting the original from which they might make up their own minds. It also provides a veneer of vérité to their reportage.
Link this back to Terrence's comedy: https://hypothes.is/a/xe15ZKPGEe6NJkeL77Ji4Q
Description and illustration are^ comple-mentary, they give together a more complete picture than citherwithout the other.
Kaiser says that "description and illustration are complementary, they give together a more complete picture than either without the other" and this sentiment is similar to Mortimer J. Adler and Charles Van Doren's pedagogy of restatement and providing concrete examples a means of testing understanding.
See: - https://hypothes.is/a/RgUa-mOcEe6PChv_seYXZA - https://hypothes.is/a/B3sDhlm5Ee6wF0fRYO0OQg
u dans certains domaines (scolarité)
L’avis de la Défenseure des droits sur laproposition de loi « visant à lutter contreles discriminations par la pratique detests individuels et statistiques »
Every commit which is merged into main is considered a stable release. Every open PR is considered a beta release that I test locally.
The two HRV measures found to significantly predict lon-gevity were increases in rMSSD and HF.
it was the mothers that made all the difference he said it was mothers mobilizing around the world that stopped the testing of nuclear weapons in the atmosphere
It's also a good idea to run qualitative usability tests to understand how people are reacting to what they see.
logging in
Logging In
// Not recommended: log into the application like a user
// by typing into the form and clicking Submit
// While this works, it is slow and exercises the login form
// and NOT the feature you are trying to test.
// oftentimes once we have a proper e2e test around logging in
// there is NO more reason to actually use our UI to log in users
// doing so wastes a huge amount of time, as our entire page has to load
// all associated resources have to load, we have to wait to fill the
// form and for the form submission and redirection process
// instead of visiting each page and waiting for all
// the associated resources to load, we can instead
// just issue a simple HTTP request and make an
// assertion about the response body
cy.request('/admin')
.its('body')
.should('include', '<h1>Admin</h1>')
instead of cy.visit
Barzun, Jacques. “Opinion | Multiple Choice Flunks Out.” The New York Times, October 11, 1988, sec. Opinion. https://www.nytimes.com/1988/10/11/opinion/multiple-choice-flunks-out.html.
Archived copy at https://web.archive.org/web/20231022192353/https://www.nytimes.com/1988/10/11/opinion/multiple-choice-flunks-out.html. Internet Archive.
Barzun takes standardized multiple-choice tests to task.
A version of this article appears in Barzun's book: Barzun, Jacques. Begin Here: The Forgotten Conditions of Teaching and Learning. University of Chicago Press, 1991. http://archive.org/details/begin-here-the-forgotten-conditions-of-teaching-and-learning.
He pointed out that these questions penalize the more imaginative and favor those who are content to collect facts. Therefore, multiple-choice test statistics, in all their uses, are misleading.
He = Banesh Hoffman
This is tangentially similar to Malcolm Gladwell's claim that standardized testing for law school privileges certain types of thinkers over others, something which creates thinkers who are good at quick things with respect to time pressures rather than slower and more deliberate thinkers who are needed at higher level functions like the Supreme Court.
See: The Tortoise and the Hare, S4 E2 of Revisionist History https://www.pushkin.fm/podcasts/revisionist-history/the-tortoise-and-the-hare
testing imagination versus fact memorization/simple recall compared with thinking quickly under pressure or slowly with time and increased ability to reason
Thirty years ago, the physicist and teacher Banesh Hoffmann wrote a book, ''The Tyranny of Testing,'' which was attacked by the test-making industry and ignored by educationists. It showed how multiple-choice questions, by their form and substance, work against the aim of teaching.
Youmust apprehend the unity with definiteness. There is only oneway to know that you have succeeded. You must be able totell yourself or anybody else what the unity is, and in a fewwords. ( If it requires too many words, you have not seen theunity but a multiplicity. ) Do not be satisfied with "feeling theunity" that you cannot express. The reader who says, "I knowwhat it is, but I just can't say it," probably does not even foolhimself.
Adler/Van Doren use the statement of unity of a work as an example of testing one's understanding of a work and its contents.
(Again, did this exist in the 1940 edition?)
Who do McDaniel and Donnelly 1996 cite in their work as predecessors of their idea as certainly it existed?
Examples in the literature of this same idea/method after this: - https://hypothes.is/a/TclhyMfqEeyTkQdZl43ZyA (Feynman Technique in ZK; relationship to Ahrens) - explain it to me like I'm a 5th grader - https://hypothes.is/a/BKhfvuIyEeyZj_v7eMiYcg ("People talk" in Algebra Project) - https://hypothes.is/a/m0KQSDlZEeyYFLulG9z0vw (Intellectual Life version) - https://hypothes.is/a/OyAAflm5Ee6GStMjUMCKbw (earlier version of statement in this same work) - https://hypothes.is/a/iV5MwjivEe23zyebtBagfw (Ahrens' version of elaboration citing McDaniel and Donnelly 1996, this uses both restatement and application to a situation as a means of testing understanding) - https://hypothes.is/a/B3sDhlm5Ee6wF0fRYO0OQg (Adler's version for testing understanding from his video) - https://hypothes.is/a/rh1M5vdEEeut4pOOF7OYNA (Manfred Kuenh and Luhmann's reformulating writing)
https://www.reddit.com/r/Zettelkasten/comments/10jx7gg/wooden_antinet_zettelkasten/
Scott Scheper commissioned a two drawer solid wood (cedar) zettelkasten box similar to those from the early 20th century. He had it listed on his website initially for $995 and then later for a reduced $495.
He created a waitlist sign up for it, ostensibly to test the interest in manufacturing/selling them as a product. To my knowledge he never made any beyond the initial prototype.
The high cost likely dampened interest compared to the much cheaper primary and secondary markets for these sorts of storage containers.
See also:<br /> - $995 https://web.archive.org/web/20230124062200/https://www.antinet.org/wooden-antinet-waitlist - $495 reduction https://web.archive.org/web/20230306195625/https://www.antinet.org/wooden-antinet-waitlist
Example annotation.
Adaptive Stress Testing with Reward Augmentation for Autonomous Vehicle Validation
Microsoft strongly recommends that you conduct email testing prior to sending live communications to your users/customers
ime to start annotating
Testing Hypothes.is extension
I ran into the same problem and never really found a good answer via the test objects. The only solution I saw was to actually update the session via a controller. I defined a new action in one of my controllers from within test_helper (so the action does not exist when actually runnning the application). I also had to create an entry in routes. Maybe there’s a better way to update routes while testing. So from my integration test I can do the following and verfiy: assert(session[:fake].nil?, “starts empty”) v = ‘Yuck’ get ‘/user_session’, :fake => v assert_equal(v, session[:fake], “value was set”)
we should have some psychological screening at the top jobs. I think that there should be an expectation that people who are about to control nuclear weapons, that can literally wipe out our species, should, at a minimum, be subject to a psychological test.
testing
REPLs are nice but they work well only for reasonably isolated code with few dependencies. It's hard to set up a complex object to pass into a function. It's harder still to set up an elaborate context of dependencies around that function.
I wonder how much of this is accomplishable by automatically parameterizing code by the types that aren't used internally so they implementation can forget about the specifics. In addition some sort of meta-programming capability to automatically generate arbitrary instances or a richer form of trace types for user types would go a long way to simplifying the trace generation.
create a new instance
Will we be able to mock this in tests?
Industrial concerns doubtless suffer enormous losses from the employment of persons whose mental ability is not equal to the tasks they are expected to perform. The present methods of trying out new employees, transferring them to simpler and simpler jobs as their inefficiency becomes apparent, is wasteful and to a great extent unnecessary. A cheaper and more satisfactory method would be to employ a psychologist to examine applicants for positions and to weed out the unfit. Any business employing as many as five hundred or a thousand workers, as, for example, a large department store, could save in this way several times the salary of a well-trained psychologist.
I think this is interesting because they are saying that intelligence testing could be used to determine job positions. I agree that employing a psychologist to examine applications for positions would be beneficial because the employer doesn't have to worry about certain things the psychologist would look for. I agree that using a psychologist to weed people out of decision of employment could be effective because many people are applying, but the employers only want certain people for that job. I think this is relevant to the history of psychology because there are some companies who use people to determine who is deemed fit for the company, and this is what they wanted to start doing so they could find the best employees for that particular job.
Nonparametric statistics are often preferred to parametric tests when the sample size is small and the data are skewed or contain outliers.
when to use parametric versus nonparametric tests [8,9,10]
Our test raises an ActiveRecord::RecordNotFound exception. We know that Rails has special handling to return a 404 status code in this case. However, the request spec still raises the exception.
https://www.antinet.org/wooden-antinet-waitlist
2023-03-06: Noting that the list price on this has now dropped to $495 including shipping. He's also closed the wait list, which I'm guessing was set up to both collect email addresses as well as to test market the demand for such a box at his various price points.
What we ultimately should care about is being able to use our knowledge to produce something new, whatever that may be. To not merely reproduce you must understand the material. And understanding requires application, a hermeneutic principle that particularly Gadamer worked out extensively. If you really want to measure your level of understanding, you should try to apply or explain something to yourself or someone else.
Animal Communication Pomona Spring 2023
testing
I've used this briefly before, but it's also something Scott Scheper swears by.
alternative: https://optimize.google.com/
I've seen a bunch of people sharing this and repeating the conclusion: that the success is because the CEO loves books t/f you need passionate leaders and... while I think that's true, I don't think that's the conclusion to draw here. The winning strategy wasn't love, it was delegation and local, on the ground, knowledge.
This win comes from a leader who acknowledges people in the stores know their communities and can see and react faster to sales trends in store... <br /> —Aram Zucker-Scharff (@Chronotope@indieweb.social) https://indieweb.social/@Chronotope/109597430733908319 Dec 29, 2022, 06:27 · Mastodon for Android
Also heavily at play here in their decentralization of control is regression toward the mean (Galton, 1886) by spreading out buying decisions over a more diverse group which is more likely to reflect the buying population than one or two corporate buyers whose individual bad decisions can destroy a company.
How is one to balance these sorts of decisions at the center of a company? What role do examples of tastemakers and creatives have in spaces like fashion for this? How about the control exerted by Steve Jobs at Apple in shaping the purchasing decisions of the users vis-a-vis auteur theory? (Or more broadly, how does one retain the idea of a central vision or voice with the creative or business inputs of dozens, hundreds, or thousands of others?)
How can you balance the regression to the mean with potentially cutting edge internal ideas which may give the company a more competitive edge versus the mean?
Let’s say, you’ve set up the list-unsubscribe header to you email campaign. Check out whether it works by sending an email to a test inbox. For Gmail, you’ll see the unsubscribe option in the header right next to the sender name
I've developed additional perspective on this issue - I have DNS settings in my hosts file that are what resolve the visits to localhost, but also preserve the subdomain in the request (this latter point is important because Rails path helpers care which subdomain is being requested) To sum up the scope of the problem as it stands now - I need a way within Heroku/Capybara system tests to both route requests to localhost, but also maintain the subdomain information of the request. I've been able to accomplish one or the other, but haven't found a configuration that provides both yet.
Honestly, at this point, I don't even know what tools I'm using, and which is responsible for what feature. Diving into the code of capybara and cucumber yields hundreds of lines of metaprogramming magic that somehow accretes into a testing framework. It's really making me loathe TDD despite my previous youthful enthusiasm.
opinion: too much metaprogramming magic
I'm not so sure it's "too much" though... Any framework or large software project is going to feel that way to a newcomer looking at the code, due to the number of layers of abstractions, etc. that eventually were added/needed by the maintainers to make it maintainable, decoupled, etc.
session = ActionDispatch::Integration::Session.new(Rails.application) response = session.post("/mypath", my_params: "go_here")
worked for me
As has been stated elsewhere, in a Capybara test you typically want to do POSTs by submitting a form just like the user would.
module InjectSession include Warden::Test::Helpers def inject_session(hash) Warden.on_next_request do |proxy| hash.each do |key, value| proxy.raw_session[key] = value end end end end
种语言词汇的
testinggggg for second page
testing for page note
Video on Functional Core, Imperative Shell paradigm. Recommended in Hypothes.is testing documentation
the functional core, imperative shell pattern
Link to video on "Boundaries" doesn't go into depth on the functional core, imperative shell pattern. However, this one does: https://www.destroyallsoftware.com/screencasts/catalog/functional-core-imperative-shell
For new code, it’s usually a good idea to design the code so that it’s easy to test with “real” objects, rather than stubs or mocks.
We keep our functional tests separate from our unit tests, in the tests/functional directory. Because these are slow to run, we will usually write one or two functional tests to check a new feature works in the common case, and unit tests for all the other cases.
Keep functional & unit tests separate. Functional for common cases, unit for all others.
To run the backend test suite only call tox directly
Probably means, "Call tox directly if you only want to run the backend test suite."
How to Turn Obsidian Into a Personal Kanban Organizer
Checking if Hypothesis works on Android via Brave mobile browser. Here's some LaTex being tested \(A \cup B\)
strange images with junk and light
another test annotation
create something weird
testing hypothesis annotations
That is called profiling, not performance testing. Performance testing should ensure that a piece of code runs within a desired amount of time, given a certain context, before the new code goes into production.
Pisani, J. (2021, July 9). Amazon to end testing for COVID-19 at warehouses this month. King5.Com. https://www.king5.com/article/news/health/coronavirus/amazon-to-stop-covid-testing/507-63caa9ee-3a30-49af-b2cd-1851637f1807
Beaumont, P. (2021, September 16). Which countries are enforcing mandatory Covid jabs – and how? The Guardian. https://www.theguardian.com/world/2021/sep/16/which-countries-enforcing-mandatory-covid-vaccination
De Block Golding, D. (2021, April 7). Viral video contains several false pandemic claims. Full Fact. https://fullfact.org/health/viral-video-contains-several-false-pandemic-claims/
Wise, J. (2021). Winter is coming—But will the gloomiest forecasts come to pass? BMJ, 374, n2357. https://doi.org/10.1136/bmj.n2357
Bays, D., Whiteley, T., Pindar, M., Taylor, J., Walker, B., Williams, H., Finnie, T. J. R., & Gent, N. (2021). Mitigating isolation: The use of rapid antigen testing to reduce the impact of self-isolation periods (p. 2021.12.23.21268326). medRxiv. https://doi.org/10.1101/2021.12.23.21268326
Lenhardt, S. (2021, November 30). Labore am Limit: Bei PCR-Tests kommt es zu Engpässen. tagesschau.de. https://www.tagesschau.de/wirtschaft/labore-corona-pcr-test-101.html
Barnes, O., & Payne, S. (2021, October 15). Covid testing suspended at private UK lab over potentially inaccurate results. Financial Times. https://www.ft.com/content/bee7bdde-e37b-4411-afd5-1e8ceacfa5f8
Koerth, M. (2021, November 3). The Science You Need To Make Your COVID-19 Decisions. FiveThirtyEight. https://projects.fivethirtyeight.com/covid-19-updates/
Chao-Fong, L. (2021, July 20). Lateral flow Covid tests run out on day lockdown is fully lifted | The Independent. https://www.independent.co.uk/news/uk/home-news/lateral-flow-test-run-out-b1886589.html
Klein, B., Generous, N., Chinazzi, M., Bhadricha, Z., Gunashekar, R., Kori, P., Li, B., McCabe, S., Green, J., Lazer, D., Marsicano, C. R., Scarpino, S. V., & Vespignani, A. (2021). Higher education responses to COVID-19 in the United States: Evidence for the impacts of university policy (p. 2021.10.07.21264419). https://doi.org/10.1101/2021.10.07.21264419
Weiss, D. J., & Shanteau, J. (2021). The futility of decision making research. Studies in History and Philosophy of Science Part A, 90, 10–14. https://doi.org/10.1016/j.shpsa.2021.08.018
ReconfigBehSci. (2021, November 26). RT @mvankerkhove: Following the meeting of the @WHO TAG-VE today, WHO classifies B.1.1.529 as a variant of concern named Omicron. We call… [Tweet]. @SciBeh. https://twitter.com/SciBeh/status/1464302372100976650
Kissane, E. (2021, December 23). We’re About to Lose Track of the Pandemic. The Atlantic. https://www.theatlantic.com/ideas/archive/2021/12/were-about-to-lose-track-of-the-pandemic/621097/
ReconfigBehSci. (2021, November 26). Parts of Germany seem to have potentially introduced requirements that cannot practically be met as testing capacity is proving insufficient—A dangerous moment for rule compliance Nadelöhr Corona-Tests: “Es ist Wahnsinn” via @sz https://t.co/meLS79RTCw [Tweet]. @SciBeh. https://twitter.com/SciBeh/status/1464287412289511432
ReconfigBehSci. (2021, December 7). RT @ravgup33_ravi: Now that there is evidence for community transmission in the U.K. and we have tighter testing for travellers we should r… [Tweet]. @SciBeh. https://twitter.com/SciBeh/status/1468135947795578884
one should not upgrade a production environment without extensive testing. I prefer to not upgrade prod at all. Instead, I create a new instance with latest everything, host my apps there, test everything out, and then make it production.
It's a great way to test various limits. When you think about this even more, it's a little mind-bending, as we're trying to impose a global clock ("who is the most up to date") on a system that inherently doesn't have a global clock. When we scale time down to nanoseconds, this affects us in the real world of today: a light-nanosecond is not very far.
As this stands, the specs could pass w/o the formatter.output == new_formatter.output check.
Oh I see whats happening, we actually have specs for this but they're not correct
Interestingly, Rails doesn't see this in their test suite because they set this value during setup:
It really slows down your test suite accessing the disk.So yes, in principle it slows down your tests. There is a "school of testing" where developer should isolate the layer responsible for retrieving state and just set some state in memory and test functionality (as if Repository pattern). The thing is Rails is a tightly coupled with implementation logic of state retrieval on core level and prefers "school of testing" in which you couple logic with state retrial to some degree.Good example of this is how models are tested in Rails. You could just build entire test suite calling `FactoryBot.build` and never ever use `FactoryBot.create` and stub method all around and your tests will be lighting fast (like 5s to run your entire test suite). This is highly unproductive to achieve and I failed many times trying to achieve that because I was spending more time maintaining my tests then writing something productive for business.Or you can took more pragmatic route and save database record where is too difficult to just 'build' the factory (e.g. Controller tests, association tests etc)Same I would say for saving the file to the Disk. Yes you are right You could just "not save the file to disk" and save few milliseconds. But at the same time you will in future stumble upon scenarios where your tests are not passing because the file is not there (e.g. file processing validations) Is it really worth it ? I never worked on a project where saving file to a disk would slow down tests significantly enough that would be an issue (and I work for company where core business is related to file uploading) Especially now that we have SSD drives in every laptop/server it's blazing fast so at best you would save 1 seconds for entire test suite (given you call FactoryBot traits to set/store file where it make sense. Not when every time you build an object.)
Điểm đo ở xã Hải Bối, huyện Đông Anh ghi nhận lượng mưa lớn với gần 250 mm. Tại nội thành, quận Bắc Từ Liêm mưa lớn nhất 240 mm, Cầu Giấy gần 140 mm, các quận Nam Từ Liêm, Bắc Từ Liêm, Hà Đông trên 100 mm.
Đoạn này là sao ấy nhỉ
Black Box testing: Software on the rack
Black Box testing: Software on the rack
Black Box testing is defined as a testing technique in which the functionality of an application is tested without looking at the internal code structure, implementation details and knowledge of internal paths of the software. This type of testing is completely based on software requirements and specifications.

When a few of his friends became interested in thetopic, he took eight minutes to progressively summarize the bestexcerpts before sharing the summarized article with them. The timethat he had spent reading and understanding a complex subject paidoff in time savings for his friends, while also giving them a newinterest to connect over.
To test one's own understanding of a topic one has read about and studied, it can be useful to discuss it or describe one's understanding to friends or colleagues in conversations. This will help you discover where the holes are based on the person's understanding and comprehension of what you've said. Can you fill in all the holes where they have questions? Are their questions your new questions which have exposed holes that need to be filled in your understanding or in the space itself.
I do this regularly in conversations with people. It makes the topics of conversation more varied and interesting and helps out your thinking at the same time. In particular I've been doing this method in Dan Allosso's book club. It's almost like trying on a new idea the way one might try on a piece of clothing to see how it fits or how one likes it for potential purchase. If an idea "fits" then continue refining it and add it to your knowledge base. These conversations also help to better link ideas in my thought space to those of what we're reading. (I wonder if others are doing these same patterns, Dan seems to, but I don't have as good a grasp on this with other participants).
Link to :<br /> - Ahren's idea of writing to expose understanding<br /> - Feynman technique<br /> - Socratic method (this is sort of side or tangential method to this) <- define this better/refine
It is important that you achieve optimal test results with software testing without deviating from the test goal. But how do you determine whether you are following the right test strategy? For this you have to follow a number of basic principles.

group.run(double.as_null_object)
What does this actually do/mean?
Some Test Scenarios: * Single or multi-browser testing * Downloading an app to the desktop * Installing browser extensions
IHME | COVID-19 Projections. (n.d.). Institute for Health Metrics and Evaluation. Retrieved April 21, 2022, from https://covid19.healthdata.org/
Individuals who harbor germline pathogenic variants in DICER1 (MIM #601200) have an increased risk for a variety of benign and malignant tumors.
GENENAME: DICER1 PMID: 31952842 HGNCID: NONE DISEASE ENTITY: THYROID DISEASE AND OVARIAN TUMORS ZYGOSITY: HETEROZYGOUS VARIANT: NONE CASE PRESENTING HPOS: cervical embryonal rhabdomyosarcoma CASE PREVIOUS TESTING: GYNECOLOGIC HISTORY MUTATION: DELETION
The issue that had roiled the grown-ups in his life seemed to have had no effect on him at all.
I wonder if that suggests the tests matter less, or more?
A 95 percent opt-out rate was a resounding success. It rivaled election results in Turkmenistan.
Wow. That's an incredibly long reach to make an incendiary point.
ReconfigBehSci [@SciBeh]. (2021, November 28). RT @CiesekSandra: Servicetweet für Labore: #Omicron wird auch von 3 PCR Systemen in den angegebenen Genen detektiert. Https://t.co/x3gZEP2r… [Tweet]. Twitter. https://twitter.com/SciBeh/status/1464991380628021254
Update on Omicron. (n.d.). Retrieved April 22, 2022, from https://www.who.int/news/item/28-11-2021-update-on-omicron
ReconfigBehSci [@SciBeh]. (2021, December 10). RT @YouAreLobbyLud: “Let’s aim to be like Denmark in 2022.” https://t.co/807jIVHdpt [Tweet]. Twitter. https://twitter.com/SciBeh/status/1469272379134201859
Ashish K. Jha, MD, MPH. (2020, October 27). President keeps saying we have more cases because we are testing more This is not true But wait, how do we know? Doesn’t more testing lead to identifying more cases? Actually, it does So we look at other data to know if its just about testing or underlying infections Thread [Tweet]. @ashishkjha. https://twitter.com/ashishkjha/status/1321118890513080322
Dr Nisreen Alwan 🌻. (2021, March 14). Exactly a year ago we wrote this letter in the Times. We were gobsmacked! We just didn’t understand what the government was basing all its decisions on including stopping testing and the herd immunity by natural infection stuff. We wanted to see the evidence backing them. [Tweet]. @Dr2NisreenAlwan. https://twitter.com/Dr2NisreenAlwan/status/1371168531669258242
Denise Dewald, MD 🗽. (2021, August 12). Here are some modeling predictions for the delta variant from COVSIM (group at North Carolina State): PLEASE CHECK THIS OUT - RESOURCES TO SHARE WITH YOUR SCHOOL DISTRICT School-level COVID-19 Modeling Results for North Carolina for #DeltaVariant https://t.co/zU5hB9bKlY [Tweet]. @denise_dewald. https://twitter.com/denise_dewald/status/1425626289399009288
Coronavirus (COVID-19) Infection Survey, UK: 29 October 2021, Office for National Statistics
When setting up SAML SSO in your organization, you can test your implementation without affecting your organization members by leaving Require SAML SSO authentication for all members of the organization name organization unchecked.
Manual testing is a type of software test in which testers manually carry out test cases without using automation tools. Testers are actually behind the screen of the application, carry out test cases and see what the result is.
Manual testing is a type of software test in which testers manually carry out test cases without using automation tools. Testers are actually behind the screen of the application, carry out test cases and see what the result is.

Carl T. Bergstrom on Twitter. (n.d.). Twitter. Retrieved April 1, 2022, from https://twitter.com/CT_Bergstrom/status/1479938695054594050
A test case is a series of actions that are performed to determine a specific function or functionality of your application. Test scenarios are rather vague and include a wide range of variables. However, testing is all about being very specific. That is why we need elaborate test cases.
Test cases, examples and Best Practices
A test case is a series of actions that are performed to determine a specific function or functionality of your application. Test scenarios are rather vague and include a wide range of variables. However, testing is all about being very specific. That is why we need elaborate test cases.

ReconfigBehSci on Twitter: ‘RT @karamballes: What is the Governments vision of “endemic”? #r4today https://t.co/1VZDJac2e7’ / Twitter. (n.d.). Retrieved 29 March 2022, from https://twitter.com/SciBeh/status/1481903227109056518
ReconfigBehSci. (2022, January 5). RT @thehowie: "...Rapid antigen tests may not be as fit-for-purpose in routine workplace screening to prevent asymptomatic spread of Omicro… [Tweet]. @SciBeh. https://twitter.com/SciBeh/status/1478796550004486151
Leo Poon. (2022, January 15). @MackayIM @Clin_Chem_AACC HK has another lay of swiss cheese. Testing waste water to identify COVID positive buildings, following by mandatory testing on occupants. It can find some silent spreaders. Https://t.co/2wu6QG6Db1 [Tweet]. @world_epidemic. https://twitter.com/world_epidemic/status/1482189879010217986
Capybara can get us part of the way there. It allows us to work with an API rather than manipulating the HTML directly, but what it provides isn't an application specific API. It gives us low-level API methods like find, fill_in, and click_button, but it doesn't provide us with high-level methods to do things like "sign in to the app" or "click the Dashboard item in the navigation bar".
A page object is a data structure that provides an interface to your web application for the purposes of test automation. For example, it could represent a single HTML page, or perhaps even a fragment of HTML on a page.
A page object wraps an HTML page, or fragment, with an application-specific API, allowing you to manipulate page elements without digging around in the HTML.
this gem promotes writing tests for data migrations providing a way allows to write code that migrates data in separate methods.
having the code migrates data separately covered by proper tests eliminates those pesky situations with outdated migrations or corrupted data.
Covid testing ‘overestimate’ grounds for concern. (2022, March 2). Newsroom. https://www.newsroom.co.nz/page/covid-testing-overestimate-grounds-for-concern
Unit, Corona Behavioural. ‘The Effect of Proximity of COVID-19 Test Facilities on Test Uptake: Two Quasi-Experimental Trials’. PsyArXiv, 17 February 2022. https://doi.org/10.31234/osf.io/rhvmc.
STLC - Software Testing Life Cycle
Software Testing Life Cycle (STLC) is defined as a set of activities performed to perform software testing. The Software Testing Life Cycle refers to a testing process with specific steps that must be performed in a specific order to ensure that quality objectives are met.

Mus, P. D. (n.d.). What Is the Difference between Pandemic and Endemic? Boston University. Retrieved February 21, 2022, from https://www.bu.edu/articles/2022/difference-between-pandemic-and-endemic/
Boris Johnson: Do not throw caution to the wind on Covid. (2022, February 20). BBC News. https://www.bbc.com/news/uk-60446908
Bill Hanage. (2022, January 7). So a brief thread on the state of the pandemic in the Boston area. It is quite plausible that about 10% of the population is currently infected, more in some age groups than others. A lot of omicron. What next? 1/n https://t.co/aXgc5GYk0V [Tweet]. @BillHanage. https://twitter.com/BillHanage/status/1479574053807697920
Nerd, G. M.-K. H. (2021, December 29). Ivermectin Didn’t Save Uttar Pradesh From Covid-19. Elemental. https://gidmk.medium.com/ivermectin-didnt-save-uttar-pradesh-from-covid-19-17684f49d8b3
Health sector facing ‘challenges not seen in a century’ as Victoria issues rare Code Brown. (2022, January 17). ABC News. https://www.abc.net.au/news/2022-01-18/victoria-records-more-covid-19-deaths-hospitalisations-and-cases/100762978
Academics: Viral Evolution Scenarios, 10 February 2022. (n.d.). GOV.UK. Retrieved February 14, 2022, from https://www.gov.uk/government/publications/academics-viral-evolution-scenarios-10-february-2022
Because CENS was an academic research lab, faculty members held a large amount of power to decide which projects students pursued and what issues students faced during design, testing, and implem
CENS seems like it takes its job seriously. Like I said in my other annotation for week 5. Just because data scientists are trying to root out bias in all forms doesn't mean it is always effective or that what is effective can't be improved.
Yaniv Erlich on Twitter. (n.d.). Twitter. Retrieved February 8, 2022, from https://twitter.com/erlichya/status/1482847821397176325
Carl T. Bergstrom. (2022, January 8). 1. A thread on the CDC’s recommendation to de-isolate five days after the onset of symptoms or positive test. First, a disclosure: I was paid as a consultant for this work, done in collaboration with @Color Health, which provides COVID testing services and vaccination logistics. [Tweet]. @CT_Bergstrom. https://twitter.com/CT_Bergstrom/status/1479938654579544065
Major COVID course correction immediately required. (n.d.). Retrieved February 7, 2022, from https://ozsage.org/media_releases/major-covid-course-correction-immediately-required/
(((Howard Forman))). (2022, January 31). Connecticut Cases down 72% from last week (lower test resulting). Positive rate 7.3%, lowest since 12/20. Hospitalizations down 29%. 46% are fully vaccinated. FWIW - my hospital is 67% down from peak census. Good news! Https://t.co/dOpFO2fjTK [Tweet]. @thehowie. https://twitter.com/thehowie/status/1488277435342901259
Camero, K. (n.d.). If You Think You Have COVID But Your Rapid Test Is Negative, Here’s Why. BuzzFeed News. Retrieved February 4, 2022, from https://www.buzzfeednews.com/article/katiecamero/negative-covid-test
Mahan Ghafari, ماهان غفاری. (2022, February 1). After the daily covid-19 cases in Iran dropped to record-low numbers last month, it is now back in full swing due to omicron! Map on the left (mostly coloured in blue) shows the situation in late december and the one on the right is from 2 days ago (many in amber or red). 1/ https://t.co/vgpwuiymbl [Tweet]. @Mahan_Ghafari. https://twitter.com/Mahan_Ghafari/status/1488481042847698946
Queensland schools to resume with mask mandate, as 13 more COVID-19 deaths recorded. (2022, January 30). ABC News. https://www.abc.net.au/news/2022-01-30/qld-coronavirus-covid19-13-deaths/100783416
Torjesen, I. (2022). Covid-19: Peak of viral shedding is later with omicron variant, Japanese data suggest. BMJ, 376, o89. https://doi.org/10.1136/bmj.o89
Michael Wagner. (2022, January 26). Omicron wave and its consequences.... Https://t.co/jNCYe2pa6k [Tweet]. @MichiWagner4. https://twitter.com/MichiWagner4/status/1486400536635129859
Que faire si vous ne recevez pas de réponse écrite ?Consultez l'accusé réception de votre demande.L'AR indique la date à laquelle, en l'absence de réponse écrite, votre demande est considérée comme acceptée ou refusée.Répondez aux questions successives et les réponses s’afficheront automatiquementSi l'AR indique que la demande est acceptée en l'absence de réponse écriteIl s'agit d'une décision implicite d'acceptation : Lorsque l'administration ne répond pas à une demande, le silence signifie que la demande est acceptée..L'AR indique que vous pouvez demander une attestation à l'administration.Si l'AR indique que la demande est refusée en l'absence de réponse écriteIl s'agit d'une décision implicite de rejet : Lorsque l'administration ne répond pas à un recours gracieux ou hiérarchique, le silence est assimilé à un rejet ..L'AR indique comment contester ce refus (voies et délais de recours). Choisir votre cas Si l'AR indique que la demande est acceptée en l'absence de réponse écrite Si l'AR indique que la demande est refusée en l'absence de réponse écrite Si l'AR indique que la demande est acceptée en l'absence de réponse écriteIl s'agit d'une décision implicite d'acceptation: titleContent.L'AR indique que vous pouvez demander une attestation à l'administration.Si l'AR indique que la demande est refusée en l'absence de réponse écriteIl s'agit d'une décision implicite de rejet: titleContent.L'AR indique comment contester ce refus (voies et délais de recours). À noter : sauf exceptions, si vous ne recevez pas de réponse au bout de 2 mois, cela signifie que votre demande est acceptée. C'est ce qu'on appelle la règle du silence vaut acceptation (SVA).
Par ailleurs, des enquêtes ou testings spécifiques doivent être menés dans les secteurs et/ou contextes où les jeunes sont les plus exposés aux discriminations, comme l’intérim, les stages, l’apprentissage ou les emplois avec des contrats courts1