Can I use rates (e.g., crude rates) as an explanatory variable (covariate) in regression?





.everyoneloves__top-leaderboard:empty,.everyoneloves__mid-leaderboard:empty{ margin-bottom:0;
}






up vote
2
down vote

favorite
2












I am working on formulating a regression problem where my dependent variable is the number of heart disease cases (count data) in a population. I am looking to do an ecological poisson-based regression.



I have crude rate per 100,000 population for smoking, where:



crude rate = (total number of smokers / total number of persons in the population) x 100,000



I would like to know if it is possible to directly use ' crude rates' as an independent, explanatory variable in my regression problem. Generally speaking, I know in regression problems 'percentages' can be used, but not sure about 'rates'. Any insights are welcome.










share|cite|improve this question




























    up vote
    2
    down vote

    favorite
    2












    I am working on formulating a regression problem where my dependent variable is the number of heart disease cases (count data) in a population. I am looking to do an ecological poisson-based regression.



    I have crude rate per 100,000 population for smoking, where:



    crude rate = (total number of smokers / total number of persons in the population) x 100,000



    I would like to know if it is possible to directly use ' crude rates' as an independent, explanatory variable in my regression problem. Generally speaking, I know in regression problems 'percentages' can be used, but not sure about 'rates'. Any insights are welcome.










    share|cite|improve this question
























      up vote
      2
      down vote

      favorite
      2









      up vote
      2
      down vote

      favorite
      2






      2





      I am working on formulating a regression problem where my dependent variable is the number of heart disease cases (count data) in a population. I am looking to do an ecological poisson-based regression.



      I have crude rate per 100,000 population for smoking, where:



      crude rate = (total number of smokers / total number of persons in the population) x 100,000



      I would like to know if it is possible to directly use ' crude rates' as an independent, explanatory variable in my regression problem. Generally speaking, I know in regression problems 'percentages' can be used, but not sure about 'rates'. Any insights are welcome.










      share|cite|improve this question













      I am working on formulating a regression problem where my dependent variable is the number of heart disease cases (count data) in a population. I am looking to do an ecological poisson-based regression.



      I have crude rate per 100,000 population for smoking, where:



      crude rate = (total number of smokers / total number of persons in the population) x 100,000



      I would like to know if it is possible to directly use ' crude rates' as an independent, explanatory variable in my regression problem. Generally speaking, I know in regression problems 'percentages' can be used, but not sure about 'rates'. Any insights are welcome.







      regression poisson-regression epidemiology






      share|cite|improve this question













      share|cite|improve this question











      share|cite|improve this question




      share|cite|improve this question










      asked Nov 10 at 9:15









      user121

      13511




      13511






















          2 Answers
          2






          active

          oldest

          votes

















          up vote
          2
          down vote



          accepted










          According to this source, "Rates are a special type of ratio that incorporate the dimension of time into the denominator".



          "For example:
          A mortality rate is the proportion of deaths occurring over a span of time in a population. "



          "Strictly speaking, these are all proportions, but incidence rates or incidence density is a measurement of the frequency of a health outcome that is more like a true rate. ...Similar to computation of an average speed for an automobile, an incidence rate is computed by dividing the total number of health-related events that occurred by the total exposure time at risk for the group."



          Below is an illustration of incidence rate taken from the above source. Here, the average rate at which the outcome occurred was 3/ 100 person-years of observed exposure time. That average rate can be expressed as a proportion.



          Taking that into account, I fail to see why you cannot use a rate as covariate in a regression model, provided that the covariate is a meaningful predictor of your outcome, and that other issues such as potential attrition bias and endogeneity have been addressed.



          For the case where the rate or proportion is the outcome variable, then I would recommend beta-regression.



          So, my answer is yes.



          enter image description here






          share|cite|improve this answer






























            up vote
            1
            down vote













            Yes, but you might want to apply feature scaling to the crude rate (other features as well) before using it to train your Poisson regression model.






            share|cite|improve this answer

















            • 1




              Can you please explain why you think feature scaling is important? Without explanation this is more of a comment.
              – kjetil b halvorsen
              Nov 10 at 13:36











            Your Answer





            StackExchange.ifUsing("editor", function () {
            return StackExchange.using("mathjaxEditing", function () {
            StackExchange.MarkdownEditor.creationCallbacks.add(function (editor, postfix) {
            StackExchange.mathjaxEditing.prepareWmdForMathJax(editor, postfix, [["$", "$"], ["\\(","\\)"]]);
            });
            });
            }, "mathjax-editing");

            StackExchange.ready(function() {
            var channelOptions = {
            tags: "".split(" "),
            id: "65"
            };
            initTagRenderer("".split(" "), "".split(" "), channelOptions);

            StackExchange.using("externalEditor", function() {
            // Have to fire editor after snippets, if snippets enabled
            if (StackExchange.settings.snippets.snippetsEnabled) {
            StackExchange.using("snippets", function() {
            createEditor();
            });
            }
            else {
            createEditor();
            }
            });

            function createEditor() {
            StackExchange.prepareEditor({
            heartbeatType: 'answer',
            convertImagesToLinks: false,
            noModals: true,
            showLowRepImageUploadWarning: true,
            reputationToPostImages: null,
            bindNavPrevention: true,
            postfix: "",
            imageUploader: {
            brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
            contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
            allowUrls: true
            },
            onDemand: true,
            discardSelector: ".discard-answer"
            ,immediatelyShowMarkdownHelp:true
            });


            }
            });














             

            draft saved


            draft discarded


















            StackExchange.ready(
            function () {
            StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstats.stackexchange.com%2fquestions%2f376302%2fcan-i-use-rates-e-g-crude-rates-as-an-explanatory-variable-covariate-in-re%23new-answer', 'question_page');
            }
            );

            Post as a guest















            Required, but never shown

























            2 Answers
            2






            active

            oldest

            votes








            2 Answers
            2






            active

            oldest

            votes









            active

            oldest

            votes






            active

            oldest

            votes








            up vote
            2
            down vote



            accepted










            According to this source, "Rates are a special type of ratio that incorporate the dimension of time into the denominator".



            "For example:
            A mortality rate is the proportion of deaths occurring over a span of time in a population. "



            "Strictly speaking, these are all proportions, but incidence rates or incidence density is a measurement of the frequency of a health outcome that is more like a true rate. ...Similar to computation of an average speed for an automobile, an incidence rate is computed by dividing the total number of health-related events that occurred by the total exposure time at risk for the group."



            Below is an illustration of incidence rate taken from the above source. Here, the average rate at which the outcome occurred was 3/ 100 person-years of observed exposure time. That average rate can be expressed as a proportion.



            Taking that into account, I fail to see why you cannot use a rate as covariate in a regression model, provided that the covariate is a meaningful predictor of your outcome, and that other issues such as potential attrition bias and endogeneity have been addressed.



            For the case where the rate or proportion is the outcome variable, then I would recommend beta-regression.



            So, my answer is yes.



            enter image description here






            share|cite|improve this answer



























              up vote
              2
              down vote



              accepted










              According to this source, "Rates are a special type of ratio that incorporate the dimension of time into the denominator".



              "For example:
              A mortality rate is the proportion of deaths occurring over a span of time in a population. "



              "Strictly speaking, these are all proportions, but incidence rates or incidence density is a measurement of the frequency of a health outcome that is more like a true rate. ...Similar to computation of an average speed for an automobile, an incidence rate is computed by dividing the total number of health-related events that occurred by the total exposure time at risk for the group."



              Below is an illustration of incidence rate taken from the above source. Here, the average rate at which the outcome occurred was 3/ 100 person-years of observed exposure time. That average rate can be expressed as a proportion.



              Taking that into account, I fail to see why you cannot use a rate as covariate in a regression model, provided that the covariate is a meaningful predictor of your outcome, and that other issues such as potential attrition bias and endogeneity have been addressed.



              For the case where the rate or proportion is the outcome variable, then I would recommend beta-regression.



              So, my answer is yes.



              enter image description here






              share|cite|improve this answer

























                up vote
                2
                down vote



                accepted







                up vote
                2
                down vote



                accepted






                According to this source, "Rates are a special type of ratio that incorporate the dimension of time into the denominator".



                "For example:
                A mortality rate is the proportion of deaths occurring over a span of time in a population. "



                "Strictly speaking, these are all proportions, but incidence rates or incidence density is a measurement of the frequency of a health outcome that is more like a true rate. ...Similar to computation of an average speed for an automobile, an incidence rate is computed by dividing the total number of health-related events that occurred by the total exposure time at risk for the group."



                Below is an illustration of incidence rate taken from the above source. Here, the average rate at which the outcome occurred was 3/ 100 person-years of observed exposure time. That average rate can be expressed as a proportion.



                Taking that into account, I fail to see why you cannot use a rate as covariate in a regression model, provided that the covariate is a meaningful predictor of your outcome, and that other issues such as potential attrition bias and endogeneity have been addressed.



                For the case where the rate or proportion is the outcome variable, then I would recommend beta-regression.



                So, my answer is yes.



                enter image description here






                share|cite|improve this answer














                According to this source, "Rates are a special type of ratio that incorporate the dimension of time into the denominator".



                "For example:
                A mortality rate is the proportion of deaths occurring over a span of time in a population. "



                "Strictly speaking, these are all proportions, but incidence rates or incidence density is a measurement of the frequency of a health outcome that is more like a true rate. ...Similar to computation of an average speed for an automobile, an incidence rate is computed by dividing the total number of health-related events that occurred by the total exposure time at risk for the group."



                Below is an illustration of incidence rate taken from the above source. Here, the average rate at which the outcome occurred was 3/ 100 person-years of observed exposure time. That average rate can be expressed as a proportion.



                Taking that into account, I fail to see why you cannot use a rate as covariate in a regression model, provided that the covariate is a meaningful predictor of your outcome, and that other issues such as potential attrition bias and endogeneity have been addressed.



                For the case where the rate or proportion is the outcome variable, then I would recommend beta-regression.



                So, my answer is yes.



                enter image description here







                share|cite|improve this answer














                share|cite|improve this answer



                share|cite|improve this answer








                edited Nov 10 at 11:21

























                answered Nov 10 at 9:58









                Krantz

                759




                759
























                    up vote
                    1
                    down vote













                    Yes, but you might want to apply feature scaling to the crude rate (other features as well) before using it to train your Poisson regression model.






                    share|cite|improve this answer

















                    • 1




                      Can you please explain why you think feature scaling is important? Without explanation this is more of a comment.
                      – kjetil b halvorsen
                      Nov 10 at 13:36















                    up vote
                    1
                    down vote













                    Yes, but you might want to apply feature scaling to the crude rate (other features as well) before using it to train your Poisson regression model.






                    share|cite|improve this answer

















                    • 1




                      Can you please explain why you think feature scaling is important? Without explanation this is more of a comment.
                      – kjetil b halvorsen
                      Nov 10 at 13:36













                    up vote
                    1
                    down vote










                    up vote
                    1
                    down vote









                    Yes, but you might want to apply feature scaling to the crude rate (other features as well) before using it to train your Poisson regression model.






                    share|cite|improve this answer












                    Yes, but you might want to apply feature scaling to the crude rate (other features as well) before using it to train your Poisson regression model.







                    share|cite|improve this answer












                    share|cite|improve this answer



                    share|cite|improve this answer










                    answered Nov 10 at 9:53









                    sPaceVodka

                    392




                    392








                    • 1




                      Can you please explain why you think feature scaling is important? Without explanation this is more of a comment.
                      – kjetil b halvorsen
                      Nov 10 at 13:36














                    • 1




                      Can you please explain why you think feature scaling is important? Without explanation this is more of a comment.
                      – kjetil b halvorsen
                      Nov 10 at 13:36








                    1




                    1




                    Can you please explain why you think feature scaling is important? Without explanation this is more of a comment.
                    – kjetil b halvorsen
                    Nov 10 at 13:36




                    Can you please explain why you think feature scaling is important? Without explanation this is more of a comment.
                    – kjetil b halvorsen
                    Nov 10 at 13:36


















                     

                    draft saved


                    draft discarded



















































                     


                    draft saved


                    draft discarded














                    StackExchange.ready(
                    function () {
                    StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstats.stackexchange.com%2fquestions%2f376302%2fcan-i-use-rates-e-g-crude-rates-as-an-explanatory-variable-covariate-in-re%23new-answer', 'question_page');
                    }
                    );

                    Post as a guest















                    Required, but never shown





















































                    Required, but never shown














                    Required, but never shown












                    Required, but never shown







                    Required, but never shown

































                    Required, but never shown














                    Required, but never shown












                    Required, but never shown







                    Required, but never shown







                    Popular posts from this blog

                    Coverage of Google Street View

                    Full-time equivalent

                    Surfing